Techniques for determining settings for a content capture device

ABSTRACT

Provided are methods, systems, and computer-program products for determining one or more settings of a content capture device. In some examples, to determine the one or more settings, luma values of pixels of an image from the content capture device may be identified. Objects in the image and information associated with the objects may also be identified. The information associated with the objects may be divided into categories of the information. And using the objects and the information, a separate weight array for each category of the information may be computed. The separate weight arrays may be combined to create a total weight array to augment the luma values. The augmented luma values may be aggregated to compute a weighted luma average for the image. Based upon a difference of the weighted luma average and a target, the one or more settings may be adjusted.

CROSS-REFERENCES TO RELATED APPLICATIONS

This application is a divisional of U.S. patent application Ser. No.15/841,043, filed on Dec. 13, 2017, entitled “TECHNIQUES FOR DETERMININGSETTINGS FOR A CONTENT CAPTURE DEVICE,” which is a non-provisional ofand claims the benefit of and priority to U.S. Provisional PatentApplication No. 62/438,926, filed on Dec. 23, 2016, entitled “METHOD ANDSYSTEM FOR DETERMINING EXPOSURE LEVELS,” the disclosures of which arehereby incorporated by reference in their entirety for all purposes.

BACKGROUND

This disclosure generally relates to determining settings (such as anexposure setting) for a content capture device. The exposure setting mayrelate to an amount of light a sensor of a content capture devicereceives when content (e.g., an image or a video) is captured. Examplesof exposure settings include a shutter speed, an aperture setting, or anInternational Standards Organization (ISO) speed.

Traditional solutions for setting exposure were handled by a user. Forexample, a user would adjust exposure settings to their liking. However,this proved to be unreliable, and often produced suboptimal results.

Today, automatic exposure control (AEC) is a standard feature oncameras. AEC automatically determines exposure settings for an imagewithout user input. Using AEC, a camera may determine an exposuresetting for the camera. However, AEC typically measures an amount oflight in a field of view, with no reference to what is in the field ofview. Therefore, there is a need in the art for improved AEC.

SUMMARY

Provided are techniques for determining one or more settings (e.g., anexposure setting and/or a gain setting) for a content capture device. Insome examples, to determine the one or more settings, luma values ofpixels of an image from the content capture device may be identified.Objects in the image and information associated with the objects mayalso be identified. The information associated with the objects may bedivided into categories. And, using the objects and the information, aseparate weight array for each category of the information may becomputed. The separate weight arrays may be combined to create a totalweight array to augment the luma values. The augmented luma values maybe aggregated to compute a weighted luma average for the image. Basedupon a difference of the weighted luma average and a target, the one ormore settings may be adjusted.

In other examples, rather than computing a separate weight array foreach category of the information, the information associated with eachobject may be used in a separate learning based model. Combining theoutput of each learning based model, a total weight array may be createdto augment the luma values of an image. The augmented luma values may beaggregated to compute a weighted luma average for the image. Based upona difference of the weighted luma average and a target, the one or moresettings may be adjusted.

In other examples, a weighting model, as described herein, may be usedfor an object in an image of a scene. The same weighting model may thenbe used for other objects in other images of the scene such that animage stitcher may combine the image and the other images together tocreate an optimized image.

Numerous benefits are achieved by way of the present disclosure overconventional techniques. For example, embodiments of the presentdisclosure provide better exposure of images by concentrating on higherpriority objects. The present disclosure also provides for properlyexposed images as part of a mixed or augmented reality scene. In someexamples, the present disclosure may even learn and adapt prioritiesassigned to objects in images.

Certain examples allow exposure to dynamically adjust based upon an eyegaze of a user, as the vector of the eye gaze changes. Examples may alsodynamically re-order object priorities based upon movement of a user orresizing a focus reticle in an image.

The present disclosure also allows for an object-basedhigh-dynamic-range (HDR) method in which multiple high priority objectsare properly exposed. These and other embodiments of the disclosure,along with many of its advantages and features, are described in moredetail in conjunction with the text below and attached figures.

Provided are techniques for updating a setting of a content capturedevice. For example, a method may include receiving an image captured bythe content capture device. In some examples, the image may include aplurality of pixels. In some examples, the image may not be presented toa user. The method may further include identifying a target luma valuefor the image. In some examples, the target luma value may be determinedbased upon at least one of an exposure setting or a gain setting of thecontent capture device.

The method may further include identifying an object in the image,dividing the plurality of pixels of the image into a plurality of pixelgroups, and calculating a pixel group luma value for each of theplurality of pixel groups. The method may further include defining afirst set of pixel groups not associated with the object, settingweights for each of the pixel groups in the first set of pixel groups,defining a second set of pixel groups associated with the object, andsetting weights for each of the pixel groups in the second set of pixelgroups. In some examples, the weights for the pixel groups in the secondset of pixel groups may be based upon the association between the secondset of pixel groups and the object. In some examples, a number of pixelsin a first pixel group may be equal to a number of pixels in a secondpixel group. In some examples, a first pixel group may be different froma second pixel group.

The method may further include calculating an image luma value using,for each of the plurality of pixel groups, the weight and the pixelgroup luma value. The method may further include updating a setting(e.g., gain and/or exposure) of the content capture device based uponthe computed difference.

In some examples, the method may further include identifying a secondobject in the image, defining a third set of pixel groups associatedwith the second object, and setting weights for each of the pixel groupsin the third set of pixel groups. The weights set for the second objectsmay be used when calculating the image luma value.

In some examples, the method may further include identifying additionalinformation associated with the object. In such examples, the additionalinformation may be a category associated with the object, a size of theobject, a distance of the object from the content capture device, or adistance that the object is located from a focus reticle of the contentcapture device. In some examples, the weights for each of the pixelgroups in the first set of pixel groups may be based upon the additionalinformation. In some examples, the weights for each of the pixel groupsin the first set of pixel groups may be further based upon secondadditional information. In such examples, the additional information maybe different than the second additional information.

As an example, the method may further include identifying additionalinformation associated with the object. The weights for each of thepixel groups in the first set of pixel groups can be based upon theadditional information. In another example, the additional informationmay include a category associated with the object, a size of the object,a distance of the object from the content capture device, or a distancethat the object is located from a focus reticle of the content capturedevice.

In some examples, the method may further include identifying a directionthat a user is looking, determining a location on the image thatcorresponds to the direction the user is looking, and determining adistance that the object is located from the location. In some examples,the weights for each of the pixel groups in the first set of pixelgroups may be based upon the additional information. The weights foreach of the pixel groups in the first set of pixel groups can be basedupon second additional information that is different than the secondadditional information.

For another example, a method may include receiving an image captured bya content capture device. In some examples, the image may include aplurality of pixels. The method may further include identifying a targetluma value for the image. In some examples, the target luma value may bebased upon a field of view. The method may further include identifyingan object in the image, identifying one or more attributes of theobject, and calculating a weight for the object using a neural network.In some examples, the neural network may use the one or more attributesas input. In such examples, an attribute of the one or more attributesof the object may include an object priority, an object distance, or anobject size. In some examples, the neural network may be a multilayerperceptron. The method may further include dividing the plurality ofpixels of the image into a plurality of pixel groups. In some examples,each pixel group of the plurality of pixel groups may be the same size.

The method may further include defining a first set of pixel groups notassociated with the object, and defining a second set of pixel groupsassociated with the object. The method may further include calculating apixel group luma value for each pixel group of the second set of pixelgroups. The method may further include calculating a pixel group lumavalue for each pixel group of the second set of pixel groups. The methodmay further include multiplying the pixel group luma value by the weightto provide a weighted pixel group luma value for each pixel group of thesecond set of pixel groups.

The method may further include calculating a total luma value for theimage. In some examples, the total luma value may include a summation ofthe weighted pixel group luma values. The method may further includecomputing a difference between the total luma value and the target lumavalue and updating a setting of the content capture device based uponthe computed difference. In some examples, the setting of the contentcapture device may be associated with exposure or gain.

The method may further include identifying a second object in the image,identifying one or more second attributes of the second object, defininga third set of pixel groups associated with the second object, andcalculating a second weight for the second object using a second neuralnetwork. In some examples, the second neural network may use the one ormore second attributes as input. The method may further includecalculating a second pixel group luma value for each pixel group of thethird set of pixel groups. The method may further include multiplyingthe second pixel group luma value by the second weight to provide aweighted second pixel group luma value for each pixel group of the thirdset of pixel groups. In some examples, the total luma value may furtherinclude a summation of the weighted second pixel group luma values.

For another example, a method may include receiving a first imagecaptured by a content capture device, identifying a first object in thefirst image, and determining a first update to a first setting of thecontent capture device. In some examples, the first update may bedetermined for the first object. In some examples, the first update maybe determined using a neural network. The method may further includereceiving a second image captured by the content capture device. In someexamples, the second image may be captured after the first image. Themethod may further include identifying a second object in the secondimage and determining a second update to a second setting of the contentcapture device. In some examples, the second update may be determinedfor the second object. In some examples, the first setting and thesecond setting may be associated with exposure or gain. In someexamples, the first setting may be the second setting. In some examples,the first update may be different than the second update. In someexamples, the first image and the second image may be in the same fieldof view The method may further include performing the first update tothe first setting of the content capture device and receiving a thirdimage captured by the content capture device. In some examples, thethird image may be captured after the first update is performed. Themethod may further include performing the second update to the secondsetting of the content capture device and receiving a fourth imagecaptured by the content capture device. In some examples, the fourthimage may be captured after the second update is performed. The methodmay further include combining the third image and the fourth image intoa single image. In some examples, the third image and the fourth imagemay be combined using an image stitcher.

According to an embodiment of the present invention, a method isprovided. The method includes receiving a first image captured by acontent capture device and identifying a predetermined number ofpriority objects in the first image. The predetermined number is greaterthan or equal to two. The method also includes determining, for each ofthe predetermined number of priority objects, one or more updates forone or more settings of the content capture device. The method furtherincludes iteratively: updating the content capture device using each ofthe one or more updates and receiving the predetermined number of imagescaptured by the content capture device using each of the one or moreupdates. Additionally, the method includes stitching the predeterminednumber of images together to form a composite image.

For another example, a method may include receiving an image captured bya content capture device. The image may include a plurality of pixels.The method may further include identifying a target luma value for theimage, dividing the plurality of pixels of the image into a plurality ofpixel groups, calculating a pixel group luma value for each of theplurality of pixel groups, and identifying a location in the image. Thelocation may correspond to a point where a user is looking in anenvironment corresponding to the image. In some examples, the locationmay be identified based upon an image of one or more eyes of the user.In other examples, the location may be identified based upon a directionof a gaze of the user. In other examples, the location may be identifiedbased upon a location of an object identified in the image. The methodmay further include setting weights for each of the plurality of pixelgroups based upon the identified location, calculating an image lumavalue using, for each of the plurality of pixel groups, the weight andthe pixel group luma value, computing a difference between the imageluma value and the target luma value, and updating a setting of thecontent capture device based upon the computed difference. The settingmay be related to gain or exposure. In some examples, the method mayfurther include dividing the plurality of pixels of the image into aplurality of patches, where a patch may include one or more pixelgroups. In such examples, setting weights may be further based upon adistance from a patch that includes the location. In some examples,setting weights may be further based upon a distance from the location.

For another example, a method may include receiving an image captured bya content capture device, where the image includes a plurality ofpixels. The method may further include identifying a target luma valuefor the image, dividing the plurality of pixels of the image into aplurality of pixel groups, calculating a pixel group luma value for eachof the plurality of pixel groups, receiving a depth map corresponding tothe image, setting weights for each of the plurality of pixel groupsbased upon the depth map, calculating an image luma value using, foreach of the plurality of pixel groups, the weight and the pixel groupluma value, computing a difference between the image luma value and thetarget luma value, and updating a setting of the content capture devicebased upon the computed difference. The setting may be related to gainor exposure. In some examples, the depth map may indicate a distancefrom a point in space for one or more points, where each of the one ormore points correspond to one or more pixels of the image. In someexamples, the method may further include capturing the depth mapconcurrently with image capture. In other examples, the method mayfurther include capturing the depth map before the image is captured,where the depth map is used to set weights for multiple images. In someexamples, setting weights may be further based upon data indicating alocation of an object from the image. The data indicating the locationof the object may be determined by analyzing pixels of the image toidentify one or more pixels of the image that match one or more pixelsof a stored image of the object.

For another example, a method may include receiving an image captured bya content capture device, where the image includes a plurality ofpixels. The method may further include identifying a target luma valuefor the image, dividing the plurality of pixels of the image into aplurality of pixel groups, and identifying multiple patches in theimage. The multiple patches may include a first patch and a secondpatch, where the first patch includes one or more pixel groups, andwhere the second patch includes one or more pixel groups different thanthe one or more pixel groups of the first patch. In some examples, themultiple patches may be identified based upon the plurality of pixels.In some examples, the multiple patches may be identified based upon oneor more objects identified in the image, where the first patch includespixels associated with a first object. The method may further includecalculating, using a first model, one or more weights for the firstpatch and calculating, using a second model, one or more weights for thesecond patch. In some examples, the first model may be based upon one ormore attributes determined for pixels included in the first patch. Insuch examples, the second model may be based upon one or more attributesdetermined for pixels included in the second patch, where the one ormore attributes associated with the first model are different than theone or more attributes associated with the second model. In someexamples, the first model is a neural network based upon one or moreattributes determined for pixels included in the first patch. In someexamples, the one or more weights for the first patch may be calculatedconcurrently with the one or more weights for the second patch. In someexamples, the first patch may be a different size than the second patch.The method may further include, for each pixel group, calculating apixel group luma value and multiplying the pixel group luma value by theweight to provide a weighted pixel group luma value. The method mayfurther include calculating a total luma value for the image, where thetotal luma value includes a summation of the weighted pixel group lumavalues. The method may further include computing a difference betweenthe total luma value and the target luma value and updating a setting ofthe content capture device based upon the computed difference. Thesetting may be related to gain or exposure.

While methods have been described above, it should be recognized that acomputer product may include a computer readable medium storing aplurality of instructions for controlling a computer system to performan operation of any of the methods described above. In addition, asystem may include the computer product and one or more processors forexecuting instructions stored on the computer readable medium. Inaddition, a system may include means for performing any of the methodsdescribed above. In addition, a system may be configured to perform anyof the methods described above. In addition, a system may includemodules that respectively perform the steps of any of the methodsdescribed above.

This summary is not intended to identify key or essential features ofthe claimed subject matter, nor is it intended to be used in isolationto determine the scope of the claimed subject matter. The subject mattershould be understood by reference to appropriate portions of the entirespecification of this patent, any or all drawings, and each claim.

The foregoing, together with other features and examples, will bedescribed in more detail below in the following specification, claims,and accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

Illustrative embodiments are described in detail below with reference tothe following figures.

FIG. 1A illustrates an example of a process for updating one or moresettings of a content capture device using automatic exposure control.

FIG. 1B illustrates an example of a process for determining how toupdate one or more settings of a content capture device.

FIG. 2 illustrates examples of various metering techniques for weightingluma values.

FIG. 3 illustrates an example of a priority weight array for an object.

FIG. 4 illustrates an example of a priority weight array for multipleobjects.

FIG. 5 illustrates an example of a focus reticle weight array.

FIG. 6 illustrates an example of an eye gaze weight array.

FIG. 7 illustrates an example of a normalized total weight array.

FIG. 8 is a flowchart illustrating an embodiment of a process forautomatic exposure control using a first weighting model.

FIG. 9 illustrates an example of a first part of a second weightingmodel that may be used for automatic exposure control.

FIG. 10 illustrates an example of a second part of a second weightingmodel that may be used for automatic exposure control.

FIG. 11 is a flowchart illustrating an embodiment of a process forautomatic exposure control using a second weighting model.

FIG. 12A illustrates an example of an image stitching process that mayuse multiple instances of automatic exposure control.

FIG. 12B illustrates another example of an image stitching process thatmay use multiple instances of automatic exposure control.

FIG. 12C is a flowchart illustrating an embodiment of a process forusing multiple instances of automatic exposure control.

FIG. 13 illustrates an example of an image stream that may be used withthe image stitching process.

FIG. 14 is a flowchart illustrating an embodiment of a process forautomatic exposure control using an image stitching process.

FIG. 15 illustrates an example of a block diagram for a computer system.

FIG. 16 is a flowchart illustrating an embodiment of a process forautomatic exposure control using a location identified based upon a gazeof a user.

FIG. 17 is a flowchart illustrating an embodiment of a process forautomatic exposure control using a depth map.

FIG. 18 is a flowchart illustrating an embodiment of a process forautomatic exposure control using multiple models.

DETAILED DESCRIPTION

In the following description, for the purposes of explanation, specificdetails are set forth in order to provide a thorough understanding ofembodiments of this disclosure. However, it will be apparent thatvarious embodiments may be practiced without these specific details. Thefigures and description are not intended to be restrictive.

The ensuing description provides exemplary embodiments only, and is notintended to limit the scope, applicability, or configuration of thisdisclosure. Rather, the ensuing description of the exemplary embodimentswill provide those skilled in the art with an enabling description forimplementing an exemplary embodiment. For example, while the descriptionmight describe pixel information, images, and/or displaying, it shouldbe recognized that audio may be generated and presented to a user by anaugmented reality device instead of or in addition to visual content. Itshould also be understood that various changes may be made in thefunction and arrangement of elements without departing from the spiritand scope of the disclosure as set forth in the appended claims.

This disclosure generally relates to determining an exposure setting fora content capture device. The exposure setting may relate to an amountof light a sensor of a content capture device receives when content(e.g., an image or a video) is captured. Examples of exposure settingsinclude a shutter speed, an aperture setting, or an InternationalStandards Organization (ISO) speed.

Traditional solutions for setting exposure were handled by a user. Forexample, a user would adjust exposure settings to their liking. However,this proved to be unreliable, and often produced suboptimal results.

Today, automatic exposure control (AEC) is a standard feature oncameras. AEC automatically determines exposure settings for an imagewithout user input. Using AEC, a camera may determine an exposuresetting for the camera. In some examples, AEC may be run in conjunctionwith an auto focus control (AF) and/or an auto white balance control(AWB) for a field of view. In such examples, the AEC may be initiallyused to compute an estimate for an amount of exposure for the field ofview. After the estimate is computed, the AF may execute to determine anamount of focus for the field of view. In some examples, after theamount of focus is determined, the AEC may continue executing to finetune the exposure settings for the field of view. In some examples, theAWB may execute at least partial parallel to the AEC. In such examples,the AWB may finish before or after the AF. In some examples, the AWB maybegin executing after the AF is finished. While a field of view isdescribed above, it should be recognized that a scene captured in animage may constitute a field of view.

Described further below are techniques for determining one or moresettings (e.g., an exposure setting and/or a gain setting) for a contentcapture device. In some examples, to determine the one or more settings,luma values of pixels of an image from the content capture device may beidentified. Objects in the image and information associated with theobjects may also be identified. The information associated with theobjects may be divided into categories. And, using the objects and theinformation, a separate weight array for each category of theinformation may be computed. The separate weight arrays may be combinedto create a total weight array to augment the luma values. The augmentedluma values may be aggregated to compute a weighted luma average for theimage. Based upon a difference of the weighted luma average and atarget, the one or more settings may be adjusted.

In other examples, rather than computing a separate weight array foreach category of the information, the information associated with eachobject may be used in a separate learning based model. Combining theoutput of each learning based model, a total weight array may be createdto augment the luma values of an image. The augmented luma values may beaggregated to compute a weighted luma average for the image. Based upona difference of the weighted luma average and a target, the one or moresettings may be adjusted.

In other examples, a weighting model, as described herein, may be usedfor an object in an image of a scene. The same weighting model may thenbe used for other objects in other images of the scene such that animage stitcher may combine the image and the other images together tocreate an optimized image.

FIG. 1A illustrates an example of a process 100 for updating one or moresettings of a content capture device using automatic exposure control(AEC). In some examples, the one or more settings may include anexposure setting, a gain setting, or any combination thereof In suchexamples, the exposure setting may be a shutter speed, an ISO speed, orany combination thereof. The gain setting may be digital gain, analoggain, or any combination thereof.

The process 100 may include receiving an image (110). In some examples,the image may be received from a sensor of the content capture device.In other examples, the image may be included in a feed that is suppliedto the process 100. It should be recognized that the image may bereceived from a number of devices and systems.

In some examples, the image may be received at an image buffer. In suchexamples, the image may be a bit-mapped image that includes pixelinformation for each pixel of the image. In some examples, the imagebuffer may be a size equal to a pixel height and pixel width of theimage. To illustrate, the image buffer size may be 640×480, where 640may correspond to a width in number of pixels of the image, and 480 maycorrespond to a height in number of pixels of the image.

The process 100 may further include dividing the image into pixel groups(120). A size and shape of each pixel group may be predefined. In someexamples, the size and shape of each pixel group may be the same orvary. For illustration purposes, the pixel groups will be described asrectangles. However, it should be recognized that the pixel groups maybe of any shape that divides the image into a plurality of portions. Forexample, the pixel groups may be radial from a center of the image. Insuch an example, each pixel group may include a different range ofdiameters (e.g., a first radial may be from 0 to 1 units from thecenter, a second radial may be from 1 to 2 units from the center, and athird radial may be from 2 to 3 units from the center). For anotherexample, the pixel groups may be associated with each object (e.g., afirst object may be a first pixel group, a second object may be a secondpixel group, and the rest of the image may be a third pixel group). Itshould also be recognized that the pixel groups may be in any other formthat divides the image into a plurality of portions. In some examples, apixel group may be arranged such that two or more pixel groups areoverlapping.

In one illustrative example, the image may be divided into 96 pixelgroups (12 pixel groups×8 pixel groups, where 12 corresponds to thenumber of pixel groups along a width of the image and 8 corresponds tothe number of pixel groups along a height of the image). In such anillustration, having 12×8 pixel groups in a 640×480 image, would meanthat each pixel group would have a height of approximately 50 pixels anda width of approximately 60 pixels. While this illustration indicatesthat the width and height of each pixel group would be different, itshould be appreciated that the width of a pixel group may be the same asthe height of the pixel group.

The process 100 may further include computing average luma pixel groupvalues for each pixel group of the image (130). In some examples, anaverage luma pixel group value may be computed by accumulating lumavalues for each pixel of a pixel group. In such examples, luma valuesmay represent the brightness of an image (e.g., an achromatic portion ofan image). In some examples, a luma value may be a representation of animage without a color component. For example, in a YUV colorspace, aluma value may be the Y. In some examples, a luma value is a weightedsum of gamma-compressed RGB components of an image. In such examples,the luma value may be referred to as gamma-corrected luma. In someexamples, accumulation may be performed by software or hardware byadding up luma values for each pixel of the pixel group. Once the lumavalues for a pixel group are accumulated, the total number may bedivided by the number of pixels in the pixel group to compute an averageluma pixel group value for the pixel group. This process may be repeatedfor each pixel group in the image.

The process 100 may further include performing AEC (140). The AEC maytake as input the average luma pixel group values for each pixel groupof the image. In some examples, an AEC method may apply weights to theaverage luma pixel group values described above using a weight array. Insuch examples, the AEC may also take as input the weight array, whichmay identify a weight to apply to each pixel group of the image.

In some examples, a weight array may include pixel groups thatcorrespond to the pixel groups created by dividing the image. Forexample, if the image is divided into 25 pixel groups (5 pixel groups×5pixel groups), the weight array may include weights for 25 pixel groups(5 pixel groups×5 pixel groups). In such an example, the top left mostpixel group in the image may correspond to the top left most pixel groupin the weight array, and so on. In some examples, values for each pixelgroup in the weight array may be based upon a number of techniquesdiscussed herein, including metering, object priority, focus reticle,eye gaze, normalized total weight array, learning based methods, thelike, or any combination thereof.

In some examples, a weight array (e.g., a weight array described inFIGS. 1-7 and 10) may be combined with the average luma pixel groupvalues to compute weighted luma pixel groups. For example, each averageluma pixel group value may be multiplied by a corresponding weight. Inother examples, a weight array may be combined with luma values ofpixels based upon pixel groups created. In such examples, a weight toapply to a pixel may be determined based upon a pixel group thatincludes the pixel. For example, if a pixel is in the top left pixelgroup in the image, a weight associated with the top left pixel group inthe weight array may be applied to the pixel. In some examples, a weightassociated with a pixel group may be multiplied by each pixel in acorresponding pixel group of the image to compute a weighted luma valuefor each pixel of the corresponding pixel group.

In some examples, the weighted luma values may be averaged together tocreate a weighted luma average for an image. In some example, theweighted luma average may be expressed as:

${{W{LA}} = \frac{\sum\limits_{r = 0}^{M - 1}{\sum\limits_{c = 0}^{N - 1}{{w\left\lbrack {r,c} \right\rbrack} \cdot {{luma}_{avg}\left\lbrack {r,c} \right\rbrack}}}}{\underset{r = 0}{\sum\limits^{M - 1}}{\overset{N - 1}{\sum\limits_{c = 0}}{w\left\lbrack {r,c} \right\rbrack}}}},$

where WLA is the weighted luma average, M is a height of a pixel group,N is a width of a pixel group, w[r,c] is location r and location c in aweight array, and luma_(avg)[r,c] is an average luma value of a pixel atlocation r and location c.

In other examples, the weight array may be used for local tone mapping.For example, a local tone mapping system may use the weighted array toidentify portions of a field of view that should be brightened. Thistechnique may deal with portions of the field of view, rather than theentire field of view as would some examples of the averaging techniquesdescribed above. In some examples, a setting (e.g., an exposure settingand/or a gain setting) may be applied to one or more portions based uponthe weight array. In such examples, the weight array may be used as aguide for the local tone mapping.

In some examples, the local tone mapping may be performed on pixels thatare above a predefined threshold, where the threshold corresponds to aweight of the weight array. For example, a pixel that is given a weightabove the threshold may have the local tone mapping system determine anadjustment of a setting (e.g., an exposure setting and/or a gainsetting) for the pixel. In some examples, the pixel and one or moreneighboring pixels may be used when comparing to the threshold. Forexample, the pixel and the one or more neighboring pixels would have tobe above the threshold for the local tone mapping to apply to the pixeland/or the one or more neighboring pixels. In some examples, local tonemapping may be supported in software and/or hardware.

In some examples, the process 100 may further include updating one ormore settings. As described above, the one or more settings may includean exposure setting, a gain setting, or any combination thereof. In suchexamples, the exposure setting may be a shutter speed, an ISO speed, orany combination thereof. In some examples, the shutter speed may be aglobal shutter or a rolling shutter. The global shutter may indicate aduration of time to expose all pixels of a field of view. The rollingshutter may indicate a duration of time to expose a row (eitherhorizontally or vertically) of a field of view. In the rolling shutter,lines of an image may be scanned in a rolling manner rather than asnapshot of the field of view. The gain setting may be digital gain,analog gain, or any combination thereof. For example, a gain setting maybe 8× by having an analog gain of 2× and a digital gain of 4×. In someexamples, the exposure setting may be adjusted before the gain settingwhen increasing exposure. In such examples, the gain setting may beadjusted before the gain setting when decreasing exposure.

FIG. 1B illustrates an example of a process 160 for determining how toupdate one or more settings of a content capture device. The process mayinclude comparing a target luma value to a luma value (e.g., a weightedluma average) (step 170).

If the weighted luma average is less than the target luma average by apredefined first threshold, the one or more settings may be adjusted tomake an image brighter (step 172). For example, if an exposure settingof the one or more settings is not at a maximum allowed exposuresetting, the exposure setting may be increased (e.g., incrementallyincreased) up to the maximum allowed exposure setting (steps 174 and176). In one illustrative example, the maximum allowed exposure settingmay be 16.6 ms for a frame rate of 30 frame per second. However, itshould be recognized that the maximum allowed exposure setting could bedifferent, even for a frame rate of 30 frames per second. The maximumallowed exposure setting may be based upon the ISO speed and the contentcapture device. In some examples, the software and/or hardware of thecontent capture device may determine a maximum allowed exposure setting(e.g., the maximum allowed exposure setting may be less than the frameperiod (1/frame rate) minus a time to transfer an image between a sensorand a host processor). In some examples, the maximum allowed exposuresetting may be less than what the software and/or hardware allows.

If the exposure setting is at the maximum allowed exposure setting, again setting of the one or more settings may be increased (e.g.,incrementally increased) up to a maximum allowed gain setting (steps 174and 178). In one illustrative example, the maximum allowed gain settingmay be 8×. However, it should be recognized that the maximum allowedgain setting could be different. The maximum allowed gain setting may bebased upon an image quality desired (e.g., noise in an image mayincrease as the gain setting increases) and software and/or hardware ofthe content capture device (e.g., a sensor may support up to a certaingain setting).

If the weighted luma average is more than the target luma average by apredefined second threshold (which may be the same or different than thefirst threshold), the one or more settings may be adjusted to make theimage darker (step 180). For example, if the gain setting is not at aminimum allowed gain setting (e.g., 1×), the gain setting may bedecreased (e.g., incrementally decreased) down to the minimum allowedgain setting (steps 182 and 184). In some examples, the minimum allowedgain setting may be determined based upon software and/hardware of thecontent capture device (e.g., a sensor of the content capture device).

If the gain setting is at the minimum allowed gain setting, the exposuresetting may be decreased (e.g., incrementally decreased) down to aminimum allowed exposure setting (e.g., 20 μsec) (steps 182 and 186).The minimum allowed exposure may correspond to an amount of time that afield of view should be exposed based upon software and/or hardware ofthe content capture device (e.g., a sensor of the content capturedevice). The amount of adjustment for either situation may be based uponan amount of difference that the weight luma average is from the targetluma average.

In some examples, the target luma average may be provided. In suchexamples, the target luma average may be provided and used until a newtarget luma average is provided. In other examples, a plurality oftarget luma averages may be provided for different situations. Forexample, the target luma average may be slightly higher outside thanindoors because it may be brighter outdoors. Accordingly, in someexamples, two different target luma averages may be provided ordetermined: one for indoor and one for outdoor. In other examples, thetarget luma average may be determined. In such examples, a currentphysical environment may be determined based upon one or more sensors.For example, the one or more sensors may detect an amount of light inthe current physical environment. In some examples, any combination ofthe one or more settings may be used to determine the target luma value.

FIG. 2 illustrates examples of various metering techniques for weightingluma values. For the various metering techniques, a weight scale 210 hasbeen provided to be a key for particular weights. Each level of theweight scale 210 includes a color that corresponds to a particularweight when included in a weight array. For example, the top box in theweight scale 210 is the darkest color and represents a weight of 1 andthe bottom box in the weight scale 210 is the lightest color andrepresents a weight of 0. While the weight scale 210 appears to linearlychange from the top box to the bottom box, it should be appreciated thatthe weight scale 210 may be different as long as the weight scale 210 isconsistent throughout a single image. In some examples, two or more ofthe metering techniques described below may be combined in a singleweight array.

A first metering technique may be spot metering. In some examples, spotmetering may refer to giving the same weight to each pixel of a selectedregion of an image. In such examples, the spot metering may beimplemented to give diminishing weight moving further from the selectedregion. In some examples, the selected region may be identified basedupon a motion of a user (e.g., by making a motion of a screen touch in afield of view). In such examples, a finger moving from a first positionto a second position in midair could correspond to selecting a region.While there would not be an actual screen that the user is touching, themotion itself may be detected. Another way that a region may be selectedwould be by putting fingers together in a way that roughly creates acircle, and everything inside of the circle would be the selection. Inother examples, a virtual frame may appear that may be moved by a user.In such examples, a circle may be shown in a display of a user that maybe moved and/or resized.

Spot metering may cause a first weight to be assigned to one or morefirst pixel groups and a second weight to be assigned to one or moresecond pixel groups. Typically, the one or more first pixel groups areidentified based upon a user selection. For example, a user may put afinger on a screen to indicate a spot. The location of the finger maydefine the one or more first pixel groups to be used for spot metering.A result of spot metering may be a spot metering weight array 220. Asmay be seen, the spot metering weight array 220 includes a spot 222. Thespot 222 may be a location identified based upon a user selection (e.g.,the one or more first pixel groups). The spot 222 includes a firstweight, which appears to be 1. The rest of the spot metering weightarray 220 (e.g., the one or more second pixel groups) includes a secondweight, which appears to be 0. It should be appreciated that differentweights may be used. It should also be appreciated that more than onespot may be identified.

A second metering technique may be center metering. In some examples,center metering may refer to giving larger weights to groups of pixelsat a center of an image and a diminishing weight moving toward edges ofthe image.

Center metering may cause a plurality of weights to be assigned to pixelgroups based upon a distance from a point (e.g., a center) of a weightarray. A result of center metering may be a center metering weight array230. The center metering weight array 230 may include a first weight fora particular distance away from the center. The first weight may be thelargest weight. In the center metering weight array 230, the firstweight is included in two pixel groups vertically from the center andthree pixel groups horizontally from the center. This illustrates thatthe distance from the center may be different horizontally andvertically. It should also be appreciated that the center metering mayalso vary in other ways, including diagonally.

The center metering weight array 230 may also include a second, a third,a fourth, and a fifth weight, each of which are one pixel group awayfrom a previous weight. In some examples, each successive level ofweight may decrease in weight. For example, the fifth weight may be lessthan the fourth weight, which may be less than the third weight, whichmay be less than the second weight, which may be less than the firstweight. Again, it should be appreciated that the center metering weightarray is just an example, and that other configurations of each level ofweight and the particular weights may be used.

A third metering technique may be image metering. Image metering maycause a single weight to be assigned to every pixel group of a weightarray. A result of image metering may be an image metering weight array240. The image metering weight array 240 includes a single weight forevery pixel group. Image metering may produce an average exposure for anentire scene.

In some examples, in addition to (or instead of) metering, one or moreobjects in an image may be assigned a priority that affects a weightarray for the image. By assigning weights for the weight array basedupon the one or more objects, the one or more objects may remainproperly exposed throughout multiple images even if the one or moreobjects are moving in relation to the image and the weight array. Theone or more objects may be properly exposed through multiple imagesbecause weights are not being applied to a single position every time,but rather change based upon the one or more objects.

FIG. 3 illustrates an example of a priority weight array 320 for anobject in an image. Similar to above, a weight scale 310 has beenprovided to be a key for particular weights. In some examples, theobject in the image may be identified by an object recognition system.In such examples, the object recognition system may identify a categoryof an object (e.g., a person, an animal, a chair, or the like). In otherexamples, the identification may identify one or more additional detailsof an object (in addition to the category of an object). For example, anidentity of a person may be identified, an identification of an animal(e.g., this is a golden retriever or this is Spot), or the like. Theobject recognition system may also determine pixel groups that includethe object.

In one illustrative example, a table as illustrated below may be used toassign a weight to an identified object. Using such a table, when anobject is identified, one or more pixel groups that are identified forthe object are assigned a priority weight in the table.

Object Priority Priority Weight Active Object 1 1.0 User Selected Object2 0.95 Identified People 3 0.9 People 4 0.85 Identified Pets 5 0.8 Pets6 0.75 Cars 7 0.70 Flowers 8 0.65 Buildings 9 0.60 Insects 10 0.55Trees/Shrubs 11 0.50 Artwork 12 0.45 Furniture 13 0.40 Other 14 0.35

In some examples, an active object (as indicated in the table above) mayrepresent an object that is actively in use by an application. Forexample, a surface may be in use by a mixed reality application. In suchan example, a character may be dancing on the surface. To ensure thatthe surface is a priority, weights associated with the surface may begreater than weights for other areas of an image.

In some examples, a user-selected object (as indicated in the tableabove) may represent an object that has been selected by a user, similarto the spot metering described above. However, unlike spot metering, theuser-selected object may include all pixel groups that the object isincluded in rather than just the pixel groups that a finger covers.

In some examples, an identified person (as indicated in the table above)may represent a person in an image that has been identified by a facerecognition system (or other identification system). For example, theface recognition system may recognize a person as Clark Kent. In such anexample, Clark Kent may be included in a list of identified people thatshould be assigned a higher weight. In addition, in some examples,particular people may be assigned different weights. In some examples,people not included in the list, and/or people not identified by name,may be given a weight different than identified people (such asindicated by “people” in the above table).

Similarly, an identified pet (as indicated in the table above) mayrepresent a pet in an image that has been identified by a facerecognition system (or other identification system). For example, theface recognition system may recognize a pet as Dixie. In such anexample, Dixie may be included in a list of identified pets that shouldbe assigned a higher weight. In addition, in some examples, particularpets may be assigned different weights. In some examples, pets notincluded in the list, and/or pets not identified by name, may be given aweight different than identified pets (such as indicated by “pets” inthe above table).

In some examples, one or more other categories of objects (as indicatedin the table above) that are identified may be assigned various weights(e.g., pets, cars, flowers, buildings, insects, trees/shrubs, artwork,furniture, or the like). In such examples, each category may be assigneda weight that may be applied to all pixel groups that include theobject. In some examples, any pixel group that does not include anobject may be assigned a different weight (such as indicated by “other”in the above table). In such examples, the different weight may be zeroor some other non-zero number. In most examples, the different weightmay be less than one or more of the other weights. In some examples, thedifferent weight may vary depending on a distance away from anidentified object.

In some examples, the weights, the priorities, and the object types ofthe table may be predefined. In some examples, the weights, thepriorities, and the object types may adjust over time based uponlearning from actions of a user. For example, if a user predominatelycaptures images that include a particular object, that object may beassigned a higher priority. For another example, the weights may changebased upon one or more actions of a user in relation to a number ofimages of a similar scene. For example, based upon which images aredeleted, the priority weights may be updated to the preferences of theuser. Similar to deleting, images that are shared with others mayindicate that the one or more settings for that image were optimal.

Referring back to FIG. 3, an area of the image may be identified thatincludes an object 322. Based upon what the object 322 is identified as,one or more pixel groups of the priority weight array 320 that includethe object 322 may be assigned a first weight. In some examples, one ormore pixel groups of the priority weight array 320 that are not includedin the object 322 may be assigned a second weight. While the priorityweight array 320 illustrates the object 322 having a weight of 1 and theone or more pixel groups that are not included in the object 322 havinga weight of 0, it should be recognized that any weight may be assignedto the object 322 as well as the one or more pixel groups that are notincluded in the object 322.

FIG. 4 illustrates an example of a priority weight array 420 formultiple objects. Similar to above, a weight scale 410 has been providedto be a key for particular weights. In some examples, one or moreobjects may be identified in an image. In such examples, an AEC maydetermine to execute using a subset of the one or more objectsidentified in the image. For example, if more than one object isidentified in the image, the AEC may determine to execute using only oneof the objects. In some examples, the AEC may determine to execute basedupon an object with the highest priority. In other examples, the AEC maydetermine to execute based upon one or more of the objects identified,but not all of the objects identified. In other examples, the AEC maydetermine to execute based upon all objects identified.

In one illustrative example, the one or more objects may include a firstobject 422, a second object 424, and a third object 426. In someexamples, each object may be assigned a weight. In some examples, two ormore weights may be similar and/or two or more weights may be different.For example, the first object 422 may be an active object (as describedin the table above), the second object 424 may be a user-selected object(as described in the table above), and the third object 426 may be a car(as described in the table above).

FIG. 5 illustrates an example of a focus reticle weight array 530.Similar to above, a weight scale 510 has been provided to be a key forparticular weights. In some examples, the focus reticle weight array 530may identify a location of a focus reticle 522 in the image 520. In someexamples, the focus reticle 522 may be adjusted based upon handgestures, an automatic focus control (AF) (as described above), and/orresizing of the focus reticle 522. In some examples, a hand gesture mayform or draw the focus reticle 522. In such examples, if an object isinside of the focus reticle 522, pixel groups of the object may be givena higher weight. If an object is on the boundary of the focus reticle522, pixel groups of the object may be given a lesser weight. If anobject is out of the focus reticle 522, pixel groups of the object maybe given the lowest weight. In some examples, the focus reticle 522 maybe resized using a command either on a content capture device or remotefrom the content capture device (e.g., a hand gesture, a remote device,or the like). In other examples, AF may indicate an area of the image520 that is the focus of the image 520. In such examples, objects in thearea may be given a higher weight.

In some examples, similar to the center metering weight array 230,weights of the focus reticle weight array 530 may decrease as distanceincreases from the center of the focus reticle 522. For example, thecenter of the focus reticle 522 may have the highest weight. Pixelgroups around the center may also have the highest weight. But, asdistance increases from the center, the pixel groups may decrease inamount of weight.

In other examples, the focus reticle weight array 530 may be utilizedwith an object identification system to increase the weight of pixelgroups that include objects that are at least partially within the focusreticle 522. For example, if the object is overlapping or completelycontained in the focus reticle 522, the weight of the pixel groups thatinclude the object may be increased. If the object is completely outsideof the focus reticle 522, the weight of the pixel groups that includethe object may be decreased.

FIG. 6 illustrates an example of an eye gaze weight array 690. Similarto above, a weight scale 610 has been provided to be a key forparticular weights. To implement the eye gaze weight array 690, a systemmay include one or more eye capture devices. An eye capture device maybe used to capture one or more images and/or one or more videos of oneor more eyes (e.g., eyes 660) of a user. The images and/or videos may befurther processed to determine a gaze 670 of the eyes 660. In someexamples, the gaze 670 may indicate a direction that the eyes 660 arelooking or a location that the eyes 660 are looking at.

In some examples, the gaze 670 may also indicate a depth that the eyes660 are looking based upon the eyes 660 of the user. In some examples,by looking at a left eye and a right eye, a system may determine thatthe gaze 670 is looking at a particular object at a particular depth. Insome examples, similar to the center metering weight array 230, weightsof the eye gaze weight array 690 may decrease as distance increases froma location that the gaze 670 is looking. For example, the center of thelocation that the gaze 670 is looking may have the highest weight. Pixelgroups around the center may also have the highest weight. But, asdistance increases from the center, the pixel groups may decrease inamount of weight.

To illustrate the eye gaze weight array 690, the gaze 670 may bepointing to a second set of pixel groups 630, which may include a secondobject 632. The pixel groups that include the second object 632 may beassigned the highest weight. Then, weights may be assigned based upon adistance away from the second object 632 and/or based upon other objectsthat are identified.

For example, a first object 622 in a first set of pixel groups 620 and athird object 642 in a third set of pixel groups 640 may be identified.In such an example, the first set of pixel groups 620 may be assigned aweight corresponding to the distance from the second object 632 and anidentification of the first object 622. If the first object 622 is anobject with a high priority, the weights of the first set of pixelgroups may be increased. On the other hand, if the first object 622 isan object with a low priority, the weights of the first set of pixelgroups may be decreased. A similar operation may be performed for thethird set of pixel groups 640 and the third object 642. In someexamples, no object may be identified in a set of pixel groups (e.g.,the fourth set of pixel groups 650). When no object is identified, theset of pixel groups may be assigned a default value or a value basedupon a distance from one or more of the objects (e.g., the second object632) or a location of the gaze 670.

FIG. 16 is a flowchart illustrating an embodiment of a process 1600 forautomatic exposure control using a location identified based upon a gazeof a user. In some aspects, the process 1600 may be performed by acomputing device (e.g., a content capture device such as a camera).

The process 1600 is illustrated as a logical flow diagram, the operationof which represent a sequence of operations that may be implemented inhardware, computer instructions, or a combination thereof. In thecontext of computer instructions, the operations representcomputer-executable instructions stored on one or more computer-readablestorage media that, when executed by one or more processors, perform therecited operations. Generally, computer-executable instructions includeroutines, programs, objects, components, data structures, and the likethat perform particular functions or implement particular data types.The order in which the operations are described is not intended to beconstrued as a limitation, and any number of the described operationsmay be combined in any order and/or in parallel to implement theprocesses.

Additionally, the process 1600 may be performed under the control of oneor more computer systems configured with executable instructions and maybe implemented as code (e.g., executable instructions, one or morecomputer programs, or one or more applications) executing collectivelyon one or more processors, by hardware, or combinations thereof. Asnoted above, the code may be stored on a machine-readable storagemedium, for example, in the form of a computer program comprising aplurality of instructions executable by one or more processors. Themachine-readable storage medium may be non-transitory.

The process 1600 may include receiving an image captured by a contentcapture device (1610). In some examples, the image may include aplurality of pixels. In some examples, the image may be received from acamera or other content capture device. In other examples, the image maybe received from a feed (or stream) of images. In such examples, thefeed may be current or past images. FIG. 6 illustrates the image asincluding the first set of patches 620, the second set of patches 630,the third set of patches 640, and the fourth set of patches 650. Itshould be recognized that the image may include more or less sets ofpatches than illustrated in FIG. 6.

The process 1600 may further include identifying a target luma value forthe image (1620). In some examples, the target luma value may indicatean optimal amount of luma for an image. In such examples, each pixel ofthe image may be associated with a luma value. In some examples, thetarget luma value may correspond to an average of the luma values of theimage. In other examples, the target luma value may correspond to aweighted average of the luma values of the image. In other examples, thetarget luma value may correspond to numbers that would result frommultiplying a weight array with luma values.

In some examples, the target luma value may be predefined. In otherexamples, the target luma value may be determined based upon one or moresensors. For one illustrative example, a sensor may detect an amount oflight in an environment. Based upon the amount of light, the target lumavalue may be set. In such an example, there might be thresholds for thetarget luma value. For example, if the amount of light is above acertain amount, the target luma value may be a certain amount.

The process 1600 may further include dividing the plurality of pixels ofthe image into a plurality of pixel groups (1630). In some examples, apixel group may represent a contiguous (or adjacent/adjoining) group ofpixels. In such examples, a shape of the pixel group may vary. Forexample, each pixel group may be a rectangle or square such that theimage is divided into rows and columns. In other examples, each pixelgroup may include a certain number of pixels from a center of the image.In some examples, different pixel groups may be different shapes (e.g.,if a pixel group is created such that it includes an object). In someexamples, pixel groups may be of different size. In some examples, apixel group may be arranged such that two or more pixel groups areoverlapping.

The process 1640 may further include calculating a pixel group lumavalue for each of the plurality of pixel groups (1640). In someexamples, the pixel group luma value for a pixel group may include anaverage of luma values for each pixel of the pixel group. In otherexamples, the pixel group luma value may be a sum of the luma values foreach pixel of a pixel group. In other examples, the pixel group lumavalue may be a difference between luma values of each pixel and anaverage luma value for the image. For example, an average luma value maybe computed for the image, taking into account every pixel. The averageluma value may then be used to compute a difference for each pixel suchthat a difference from the average luma value is generated for a pixelgroup. It should be recognized that other summary values of luma valuesof a pixel group may be used.

The process 1600 may further include identifying a location in the image(1650). The location may correspond to a point where a user is lookingin an environment (i.e., a physical area where the user is located)corresponding to the image. For example, the image may correspond to ageneral direction that a user is looking. The location may be theposition in the image at which it is determined that the user isactually looking.

In some examples, the location may be identified based upon an eye imageof one or more eyes of the user. The eye image may be separate from theimage described above. In such examples, the eye image may be analyzedto identify where the user is looking. For example, each set of one ormore pixels of the eye image may be determined to correspond to adifferent set of one or more pixels of the image. In such an example, ifthe one or more eyes in the eye image are determined to be looking at aparticular set of one or more pixels of the eye image, the location inthe image may be identified as a set of one or more pixels of the imagethat correspond to the particular set of one or more pixels of the eyeimage. It should be recognized that more than one eye image may be used.For example, an eye image of a first eye and an eye image of a secondeye may be used. In some examples, the location may correspond to alocation of an object identified in the image. It should also berecognized that other methods for identifying where a user is lookingbased upon an eye image of one or more eyes of the user may be used.

In other examples, the location may be identified based upon a directionof a gaze of the user. In such examples, instead of using an eye imageof one or more eyes of the user, an orientation of a device worn by theuser may be used. In one illustrative example, the orientation mayinclude one or more from the set of pitch, yaw, and roll. In such anexample, a sensor (e.g., gyroscope) that measures the orientation of thedevice may be included with the device. It should be recognized thatother methods for identifying where a user is looking based upon anorientation of a device may be used.

The process 1600 may further include setting weights for each of theplurality of pixel groups based upon the identified location (1660). Forexample, pixels groups that correspond to locations that are closer tothe identified location may be set as higher weights than pixels groupsthat correspond to locations that are farther from the identifiedlocation. It should be recognized that weights may be set according toother methods. FIG. 6 illustrates weights set for pixel groups in theeye weight array 690.

In some examples, the process 1600 may further include dividing theplurality of pixels of the image into a plurality of patches, where apatch includes one or more pixel groups. In such examples, settingweights may be further based upon a distance from a patch that includesthe location. FIG. 6 illustrates dividing the plurality of pixels of theimage into a plurality of patches. For example, a first patch may be thefirst set of patches 620, a second patch may be the second set ofpatches 630, a third patch may be the third set of patches 640, and afourth patch may be the fourth set of patches 650. It should berecognized that the image may be divided into more or less sets ofpatches than illustrated in FIG. 6.

The process 1600 may further include calculating an image luma valueusing, for each of the plurality of pixel groups, the weight and thepixel group luma value (1670). In some examples, calculating the imageluma value may include summing each of the weighted pixel group lumavalues, where a weighted pixel group luma value may be computed bymultiplying a weight associated with a pixel group with a luma valueassociated with the pixel group.

The process 1600 may further include computing a difference between theimage luma value and the target luma value (1680), and updating asetting of the content capture device based upon the computed difference(1690). In some examples, the setting may be an exposure setting or again setting. In such examples, the exposure setting may be a shutterspeed, an ISO speed, or any combination thereof. In some examples, theshutter speed may be a global shutter or a rolling shutter. The globalshutter may indicate a duration of time to expose all pixels of a fieldof view. The rolling shutter may indicate a duration of time to expose arow (either horizontally or vertically) of a field of view. In therolling shutter, lines of an image may be scanned in a rolling mannerrather than a snapshot of the field of view. The gain setting may bedigital gain, analog gain, or any combination thereof. For example, again setting may be 8× by having an analog gain of 2× and a digital gainof 4×. In some examples, the exposure setting may be adjusted before thegain setting when increasing exposure. In such examples, the gainsetting may be adjusted before the exposure setting when decreasingexposure. In some examples, the amount of adjustment may be proportionalto the difference. For example, the adjustment may be larger if thedifference between the image luma value and the target luma value islarger.

FIG. 7 illustrates an example of a normalized total weight array 730 foran image 720. Similar to above, a weight scale 710 has been provided tobe a key for particular weights. In some examples, the normalized totalweight array 730 may be a combination of two or more weight arrays. Insuch examples, the two or more weight arrays may be generated based uponany method, including the methods described above (e.g., metering, gaze,focus reticle, and the like).

In some examples, the normalized total weight array 730 may be basedupon the image 720. In such examples, one or more objects (e.g., woman722, child 724, cat 726, toy house 728, and toy truck 729) may beidentified in the image 720. Based upon identifying the one or moreobjects, one or more weight arrays may be generated.

For example, an object priority weight array 740 (similar to thepriority weight array 420) may be generated. The object priority weightarray 740 may assign weights to pixel groups that include the one ormore objects. In some examples, the weights may be based upon a table asdescribed above with FIG. 3. For example, people may be associated witha higher weight than animals. Accordingly, pixel groups including thewoman 722 and the child 724 may be given a weight higher than the pixelgroups including the cat 726. In addition, toys may be associated with alower weight than animals, and thus, pixel groups including the toyhouse 728 and the toy truck 729 may have a lower weight than the pixelgroups including the cat 726. Other pixel groups that do not include anidentified object may have a lower weight than the toys. For example,the illustration in FIG. 7 shows the other pixel groups to have a weightof 0; however, the weight could be non-zero.

For another example, an object distance weight array 750 may begenerated using the one or more identified objects. In such an example,the object distance weight array 750 may further be based upon a depthmap. The depth map may be created from an image (or a set of images) anddescribe a depth of each pixel. Using the depth map, an identifiedobject may be associated with a depth. Then, objects closer to an imagesource (e.g., a content capture device that takes the image 720) may beassigned a higher weight.

For example, pixel groups of the toy house 728 may be given the highestweight because the toy house is the closest object. Pixel groups of thewoman 722, the child 724, and the cat 726 may be assigned a similarweight that is less than the weight for the toy house 728 because theseobjects are about the same distance away from the image source but arefarther away than the toy house 728. Similarly, pixel groups of the toytruck 729 may be assigned the smallest weight of the identified objectsbecause the toy truck 729 is the farthest object away. In some examples,other pixel groups that do not include an object may be either assigneda weight of zero or some other non-zero weight that is less than weightsassigned to pixel groups of objects.

In other examples, each pixel group may be assigned a weight regardlessof whether an object is identified. In such examples, objects do notneed to be identified because each weight may correspond to a value inthe depth map. It should be recognized that other methods for weighingobjects based upon distance may be used.

For another example, an object size weight array 760 may be generatedusing the one or more identified objects. While a size of an object maybe computed in several different ways, one way to determine a size of anobject is to count the number of pixel groups that were identified inthe object size weight array 760 for the object.

In some examples, after computing the size of each identified object,the pixel groups of each identified object may be assigned a weight thatis proportional to its size. For example, the toy house 728 may be giventhe largest weight because it is the biggest object. In one illustrativeexample, the woman 722, the child 724, and the toy truck 729 may all begiven the same weight because they are all similar sizes. And the cat726 may be given the smallest weight because it is the smallest. Otherpixel groups not including an identified object may either be given azero weight or a non-zero weight that is smaller than the smallestweight for an identified object.

In other examples, after computing the size of each identified object,the pixel groups of each identified object may be given a percentage ofweight for each pixel group that is identified for the identifiedobject. For example, if each pixel group is worth 0.1 weight, an objectwith 4 pixel groups would be 0.4 weight. In other examples, the amountof weight each pixel group is worth may be a percentage of the largestobject. For example, the largest object may be given a weight of 1. Andif that object is 10 pixel groups large, an object with 5 pixel groupsmay be given 0.5 weight. It should be recognized that other methods forweighing objects based upon size may be used.

For another example, one or more metering weight arrays may begenerated, as described in FIG. 2. In one illustrative example, a centermetering weight array 770 (similar to the center metering weight array230) may be generated.

For another example, an eye gaze weight array 790 (similar to the eyegaze weight array 690) may be generated.

In some examples, the image 720 may also include information associatedwith a focus reticle as described with FIG. 4. The focus reticle, aloneor in combination with the identified objects, may be used to generate afocus reticle weight array 780 (similar to the focus reticle weightarray 530).

As described above, after the two or more weight arrays are generated,the two or more weight arrays may be combined to create the normalizedtotal weight array 730. For example, weights for a pixel group across aplurality of weight arrays may be combined (e.g., multiplied together)to generate an updated weight for the pixel group. The normalized totalweight array 730 may be combined with the average luma pixel groupvalues to compute weighted luma pixel groups, as described above withFIG. 1. In some examples, the normalized total weight array 730 may beexpressed as:

${w_{T}\left\lbrack {r,c} \right\rbrack} = {\prod\limits_{i = 0}^{N_{w} - 1}{w_{i}\left\lbrack {r,c} \right\rbrack}}$

where N_(w) is the number of weight arrays. In some examples, anormalization of the total weight array may be performed such that thetotal weight array is scaled according to a largest weight having avalue of 1 and a smallest weight having a minimum of w_(T)[r,c] (e.g.,0). In some examples, normalization may be skipped because the weightedluma average automatically scales results by dividing by the summationof the total weight array.

FIG. 8 is a flowchart illustrating an embodiment of a process forautomatic exposure control using a first weighting model. In someaspects, the process 800 may be performed by a computing device.

Process 800 is illustrated as a logical flow diagram, the operation ofwhich represent a sequence of operations that may be implemented inhardware, computer instructions, or a combination thereof. In thecontext of computer instructions, the operations representcomputer-executable instructions stored on one or more computer-readablestorage media that, when executed by one or more processors, perform therecited operations. Generally, computer-executable instructions includeroutines, programs, objects, components, data structures, and the likethat perform particular functions or implement particular data types.The order in which the operations are described is not intended to beconstrued as a limitation, and any number of the described operationsmay be combined in any order and/or in parallel to implement theprocesses.

Additionally, the process 800 may be performed under the control of oneor more computer systems configured with executable instructions and maybe implemented as code (e.g., executable instructions, one or morecomputer programs, or one or more applications) executing collectivelyon one or more processors, by hardware, or combinations thereof. Asnoted above, the code may be stored on a machine-readable storagemedium, for example, in the form of a computer program comprising aplurality of instructions executable by one or more processors. Themachine-readable storage medium may be non-transitory.

The process 800 may include receiving an image captured by a contentcapture device (805). In some examples, the image may include aplurality of pixels. In some examples, the image may be received from acamera or other content capture device. In other examples, the image maybe received from a feed (or stream) of images. In such examples, thefeed may be current or past images.

The process 800 may further include identifying a target luma value forthe image (810). In some examples, the target luma value may indicate anoptimal amount of luma for an image. In such examples, each pixel of theimage may be associated with a luma value. In some examples, the targetluma value may correspond to an average of the luma values of the image.In other examples, the target luma value may correspond to a weightedaverage of the luma values of the image. In other examples, the targetluma value may correspond to numbers that would result from multiplyinga weight array with luma values.

In some examples, the target luma value may be predefined. In otherexamples, the target luma value may be determined based upon one or moresensors. For one illustrative example, a sensor may detect an amount oflight in an environment. Based upon the amount of light, the target lumavalue may be set. In such an example, there might be thresholds for thetarget luma value. For example, if the amount of light is above acertain amount, the target luma value may be a certain amount.

The process 800 may further include identifying an object in the image(815). In some examples, the object may be identified using an objectidentification system. In such examples, the object identificationsystem may identify a type of the object (e.g., a person, an animal, abuilding, or the like). In other examples, the object identificationsystem may identify one or more attributes of the object that indicatemore than just a type of the object (e.g., that the object is Bill Nye).In some examples, the object identification system may identify whetherthe object is being processed by another system (such as a mixed realitysystem that is determining where to place a virtual object).

The process 800 may further include dividing the plurality of pixels ofthe image into a plurality of pixel groups (820). In some examples, apixel group may represent a contiguous (or adjacent/adjoining) group ofpixels. In such examples, a shape of the pixel group may vary. Forexample, each pixel group may be a rectangle or square such that theimage is divided into rows and columns. In other examples, each pixelgroup may include a certain number of pixels from a center of the image.In some examples, different pixel groups may be different shapes (e.g.,if a pixel group is created such that it includes an object). In someexamples, pixel groups may be of different size. In some examples, apixel group may be arranged such that two or more pixel groups areoverlapping.

The process 800 may further include calculating a pixel group luma valuefor each of the plurality of pixel groups (825). In some examples, thepixel group luma value for a pixel group may include an average of lumavalues for each pixel of the pixel group. In other examples, the pixelgroup luma value may be a sum of the luma values for each pixel of apixel group. In other examples, the pixel group luma value may be adifference between luma values of each pixel and an average luma valuefor the image. For example, an average luma value may be computed forthe image, taking into account every pixel. The average luma value maythen be used to compute a difference for each pixel such that adifference from the average luma value is generated for a pixel group.It should be recognized that other summary values of luma values of apixel group may be used.

The process 800 may further include defining a first set of pixel groupsnot associated with the object (830). In some examples, the first set ofpixel groups may include one or more other objects and/or no objects. Insome examples, not being associated with the object may indicate that apixel of one or more pixels that have been identified as the object isnot included in the first set of pixel groups.

The process 800 may further include setting weights for each of thepixel groups in the first set of pixel groups (835). In some examples,the weights for each of the pixel groups in the first set of pixelgroups may be based upon a table as described above with respect to FIG.3. In other examples, the weights of each of the pixel groups in thefirst set of pixel groups may be automatically set to zero because theobject is not included in the pixel groups. It should be recognized thatother ways to determine a weight of a pixel group without the object maybe used.

The process 800 may further include defining a second set of pixelgroups associated with the object (840). In some examples, beingassociated with the object may indicate that a pixel of one or morepixels that have been identified as the object is included in the secondset of pixel groups.

The process 800 may further include setting weights for each of thepixel groups in the second set of pixel groups (845). Similar to asdescribed above, the weights for each of the pixel groups in the secondset of pixel groups may be based upon a table as described above withrespect to FIG. 3. In other examples, the weights of each of the pixelgroups in the second set of pixel groups may be automatically set to 1because the object is included in the pixel groups. It should berecognized that other ways to determine a weight of a pixel group withthe object may be used.

The process 800 may further include calculating an image luma valueusing, for each of the plurality of pixel groups, the weight and thepixel group luma value (850). In some examples, calculating the imageluma value may include summing each of the weighted pixel group lumavalues, where a weighted pixel group luma value may be computed bymultiplying a weight associated with a pixel group with a luma valueassociated with the pixel group.

The process 800 may further include computing a difference between theimage luma value and the target luma value (855), and updating a settingof the content capture device based upon the computed difference (860).In some examples, the setting may be an exposure setting or a gainsetting. In such examples, the exposure setting may be a shutter speed,an ISO speed, or any combination thereof. In some examples, the shutterspeed may be a global shutter or a rolling shutter. The global shuttermay indicate a duration of time to expose all pixels of a field of view.The rolling shutter may indicate a duration of time to expose a row(either horizontally or vertically) of a field of view. In the rollingshutter, lines of an image may be scanned in a rolling manner ratherthan a snapshot of the field of view. The gain setting may be digitalgain, analog gain, or any combination thereof. For example, a gainsetting may be 8× by having an analog gain of 2× and a digital gain of4×. In some examples, the exposure setting may be adjusted before thegain setting when increasing exposure. In such examples, the gainsetting may be adjusted before the exposure setting when decreasingexposure. In some examples, the amount of adjustment may be proportionalto the difference. For example, the adjustment may be larger if thedifference between the image luma value and the target luma value islarger.

Using the weighted luma average equation described above in regards toFIG. 1, a specific weight may be disabled for the first weighting modelby setting an array associated with the weight to unity.

FIG. 17 is a flowchart illustrating an embodiment of a process 1700 forautomatic exposure control using a depth map. The depth map may includeone or more distances, where a distance is measured from a single pointfor each of the one or more distances. Each distance may correspond toone or more pixels in an image such that each distance indicates adistance that content included in the one or more pixels is from thesingle point.

In some aspects, the process 1700 may be performed by a computing device(e.g., a content capture device such as a camera). The process 1700 isillustrated as a logical flow diagram, the operation of which representa sequence of operations that may be implemented in hardware, computerinstructions, or a combination thereof. In the context of computerinstructions, the operations represent computer-executable instructionsstored on one or more computer-readable storage media that, whenexecuted by one or more processors, perform the recited operations.Generally, computer-executable instructions include routines, programs,objects, components, data structures, and the like that performparticular functions or implement particular data types. The order inwhich the operations are described is not intended to be construed as alimitation, and any number of the described operations may be combinedin any order and/or in parallel to implement the processes.

Additionally, the process 1700 may be performed under the control of oneor more computer systems configured with executable instructions and maybe implemented as code (e.g., executable instructions, one or morecomputer programs, or one or more applications) executing collectivelyon one or more processors, by hardware, or combinations thereof. Asnoted above, the code may be stored on a machine-readable storagemedium, for example, in the form of a computer program comprising aplurality of instructions executable by one or more processors. Themachine-readable storage medium may be non-transitory.

The process 1700 may include receiving an image captured by a contentcapture device (1710). In some examples, the image may include aplurality of pixels. In some examples, the image may be received from acamera or other content capture device. In other examples, the image maybe received from a feed (or stream) of images. In such examples, thefeed may be current or past images.

The process 1700 may further include identifying a target luma value forthe image (1720). In some examples, the target luma value may indicatean optimal amount of luma for an image. In such examples, each pixel ofthe image may be associated with a luma value. In some examples, thetarget luma value may correspond to an average of the luma values of theimage. In other examples, the target luma value may correspond to aweighted average of the luma values of the image. In other examples, thetarget luma value may correspond to numbers that would result frommultiplying a weight array with luma values.

In some examples, the target luma value may be predefined. In otherexamples, the target luma value may be determined based upon one or moresensors. For one illustrative example, a sensor may detect an amount oflight in an environment. Based upon the amount of light, the target lumavalue may be set. In such an example, there might be thresholds for thetarget luma value. For example, if the amount of light is above acertain amount, the target luma value may be a certain amount.

The process 1700 may further include dividing the plurality of pixels ofthe image into a plurality of pixel groups (1730). In some examples, apixel group may represent a contiguous (or adjacent/adjoining) group ofpixels. In such examples, a shape of the pixel group may vary. Forexample, each pixel group may be a rectangle or square such that theimage is divided into rows and columns. In other examples, each pixelgroup may include a certain number of pixels from a center of the image.In some examples, different pixel groups may be different shapes (e.g.,if a pixel group is created such that it includes an object). In someexamples, pixel groups may be of different size. In some examples, apixel group may be arranged such that two or more pixel groups areoverlapping.

The process 1740 may further include calculating a pixel group lumavalue for each of the plurality of pixel groups (1740). In someexamples, the pixel group luma value for a pixel group may include anaverage of luma values for each pixel of the pixel group. In otherexamples, the pixel group luma value may be a sum of the luma values foreach pixel of a pixel group. In other examples, the pixel group lumavalue may be a difference between luma values of each pixel and anaverage luma value for the image. For example, an average luma value maybe computed for the image, taking into account every pixel. The averageluma value may then be used to compute a difference for each pixel suchthat a difference from the average luma value is generated for a pixelgroup. It should be recognized that other summary values of luma valuesof a pixel group may be used.

The process 1700 may further include receiving a depth map correspondingto the image (1750). As described above, the depth map may include oneor more distances, where a distance is measured from a single point foreach of the one or more distances. Each distance may correspond to oneor more pixels in an image such that each distance indicates a distancethat content included in the one or more pixels is from the singlepoint.

In some examples, the depth map may be captured concurrently with theimage capture described above. For example, whenever an image iscaptured for use with determining a setting for the content capturedevice, a depth map may also be captured. In other examples, the depthmap may be captured at some point before the image is captured such thatthe depth map may be used to set weights for multiple images. In suchexamples, depths maps may be used based upon an orientation that theimage was captured. For example, each of one or more depth maps maycorrespond to a different orientation. When an image is taken at aparticular orientation, a depth map (or a portion of a depth map)corresponding to the particular orientation may be used when settingweights to be used to determining a setting for the content capturedevice.

Referring to FIG. 7, the depth map may indicate a distance that each ofthe woman 722, the child 724, the cat 726, the toy house 728, and thetoy truck 729 are from a content capture device. It should be recognizedthat the distances might not be object based but pixel based such that adistance is for a pixel rather than an object. It should also berecognized that the depth map may indicate depths in different ways thandescribed above.

The process 1700 may further include setting weights for each of theplurality of pixel groups based upon the depth map (1760). The weightsmay be set such that closer sets of one or more pixels are higher thansets of one or more pixels that are father away. In some examples,setting weights may be further based upon data indicating a location ofan object from the image. For example, as illustrated in FIG. 7, theweights in the object distance weight array 750 are set such that onlypixels that are identified as associated with an object are given aweight. It should be recognized that other pixels may be set asdifferent weights that are zero or more. In some examples, the dataindicating the location of the object is determined by analyzing pixelsof the image to identify one or more pixels of the image that match oneor more pixels of a stored image of the object. It should also berecognized that a distance weight array may be a result of settingweights, the distance weight array not based upon objects identified inan image. It should also be recognized that other methods for settingweights using a depth map be used than described above.

The process 1700 may further include calculating an image luma valueusing, for each of the plurality of pixel groups, the weight and thepixel group luma value (1770). In some examples, calculating the imageluma value may include summing each of the weighted pixel group lumavalues, where a weighted pixel group luma value may be computed bymultiplying a weight associated with a pixel group with a luma valueassociated with the pixel group.

The process 1700 may further include computing a difference between theimage luma value and the target luma value (1780), and updating asetting of the content capture device based upon the computed difference(1790). In some examples, the setting may be an exposure setting or again setting. In such examples, the exposure setting may be a shutterspeed, an ISO speed, or any combination thereof. In some examples, theshutter speed may be a global shutter or a rolling shutter. The globalshutter may indicate a duration of time to expose all pixels of a fieldof view. The rolling shutter may indicate a duration of time to expose arow (either horizontally or vertically) of a field of view. In therolling shutter, lines of an image may be scanned in a rolling mannerrather than a snapshot of the field of view. The gain setting may bedigital gain, analog gain, or any combination thereof. For example, again setting may be 8× by having an analog gain of 2× and a digital gainof 4×. In some examples, the exposure setting may be adjusted before thegain setting when increasing exposure. In such examples, the gainsetting may be adjusted before the exposure setting when decreasingexposure. In some examples, the amount of adjustment may be proportionalto the difference. For example, the adjustment may be larger if thedifference between the image luma value and the target luma value islarger.

FIG. 9 illustrates an example of a first part of a second weightingmodel that is used for automatic exposure control (AEC). In someexamples, the second weighting model may treat each object of an imageindependently of everything else in the image. In such examples, alearning based model (e.g., neural network, clustering, or the like) maybe used for each object of the image (or a subset of the objects). Thelearning based model may output a value to be used as a weight for eachpixel group for the object. After a weight is generated for each objectof the image, a final weight array may be created by summing each one ofthe weight arrays for each object. This is different than the firstweighting model, where we multiplied the weight arrays together. Inother examples, a learning based model may be used for each pixel groupof an object. In such examples, the rest of the process would beperformed similarly to having a learning based model for each object,except rather than having a single value used for every pixel group ofan object, a value would be computed for each pixel group. In otherexamples, a learning based model may be used for each weight (e.g.,object distance, object priority, metering, or the like) for an object.In such examples, the weights for a single object would need to becombined as discussed above for the first weighting model. After theweights for each object are combined, the rest of the process maycontinue as the second weighting model would.

In one illustrative example, the second weighting model may utilize oneor more neural networks (e.g., a multilayer perceptron) to determine aweight array. In some examples, the second weighting model may create aneural network for each of one or more identified objects in an image.In such examples, inputs to a neural network may include weights similarto those described above for weight arrays. In one illustrative example,the weights may include: Object Priority—{0.1 . . . 1}; ObjectSize—Small, Medium or Large {0.5, 0.75, 1}; Object Distance—Near, Mid orFar {0.5, 0.75, 1}; Metering—Edge to Center {0.9 . . . 1}; FocusReticle—In, Near or Out {0.5, 0.75 or 1}; and Eye Gaze—“At the vector”,“near the vector” or “out of the vector” {0.5, 0.75, 1}.

In the example above, the object priority weight may indicate a priorityof an object. The priority of the object may be between 0 and 1. In someexamples, the priority may be defined by a table as described above withrespect to FIG. 3.

The object size weight may indicate a size of an object, The size of theobject may be one of an enumerated set (e.g., small, medium, or large).In some examples, a size that corresponds to each one of the enumeratedset may be predefined (e.g., if an object is included in three or lesspixel groups, the object is small and receives a weight of 0.5; if anobject is included in four to six pixel groups, the object is medium andreceives a weight of 0.75; and if an object is included in six or morepixel groups, the object is large and receives a weight of 1). In otherexamples, the size of the object may merely be the number of pixelgroups that the object is included in.

The object distance weight may indicate a distance from an image source(e.g., a content capture device). The distance of the object may be oneof an enumerated set (e.g., near, mid, or far). In some examples, adistance that corresponds to each one of the enumerated set may bepredefined (e.g., if an object is within 10 feet of the image source,the object is near and receives a weight of 0.5; if an object is between10 and 50 feet of the image source, the object is mid and receives aweight of 0.75; and if an object is more than 50 feet from the imagesource, the object is far and receives a weight of 1). In some examples,the distance may be computed using a depth map that was generated usinga depth capturing device.

The metering weight may indicate a value associated with the object fromone or more metering techniques as described above. For example, if anobject is near an edge of a field of view of an image, a weight value of0.9 may be assigned. If the object is at a center of the field of view,a weight value of 1.0 may be assigned.

The focus reticle weight may indicate where the object is relative to afocus reticle. A value for the focus reticle may be one of an enumeratedset (e.g., In, Near, or Out). In some examples, the values for theenumerated set may be predefined (e.g., if an object is inside of thefocus reticle, the object is in and receives a weight of 0.5; if anobject overlaps the focus reticle, the object is near and receives aweight of 0.75; and if an object is outside of the focus reticle, theobject is out and receives a weight of 1).

The eye gaze weight may indicate where a user is looking relative to theobject. A value for the eye gaze may be one of an enumerated set (e.g.,“at the vector,” “near the vector,” or “out of the vector”). In someexamples, the values for the enumerated set may be predefined (e.g., ifthe eye gaze is a second distance (greater than the first distance) froman object, the eye gaze is “out of the vector” and receives a weight of0.5; if the eye gaze is a first distance away from an object, the eyegaze is “near the vector” and receives a weight of 0.75; and if the eyegaze is toward an object, the eye gaze is “at the vector” and receives aweight of 1).

Referring to FIG. 9, a first neural network for a first object (910) maybe used. For the first neural network, each weight described above forthe first object may be an input of inputs 920. For example, the objectpriority weight may be an exposure weight parameter 1 (922), the objectsize weight may be an exposure weight parameter 2 (924), and the eyegaze weight may be an exposure weight parameter n (926). It should berecognized that any number of weights may be used as the inputs 920.

The inputs 920 may then be passed to a hidden layer 930. While thehidden layer 930 is illustrated as one level, there may be more levelsdepending on the implementation of the neural network. For each level ofthe hidden layer 930, each value of a preceding level (which would bethe inputs 920 for a first level) is multiplied by a value determined bythe neural network for each node (e.g., node 932, node 934, node 936,and node 938) of the level. At each node, a function is applied to theinput of the node. After the hidden layer 930 is complete, a final valueis multiplied by each output, and the result is combined to create anobject 1 exposure weight (922).

As may be seen in FIG. 9, a similar process as described above may beperformed for each object in the image (e.g., object 2 neural network(950) and object M neural network (960)).

FIG. 10 illustrates an example of a second part of the second weightingmodel that may be used for automatic exposure control (AEC). The secondpart may illustrate what occurs after one or more neural networks areperformed for each object in the image. Each of the neural networks(e.g., object 1 neural network 1030, object 2 neural network 1040,object 3 neural network 1050, object 4 neural network 1060, and object 5neural network 1070) may output a single value (as described above). Thesingle value may be a weight for an object associated with a neuralnetwork.

The single value may be applied to all of the pixel groups of the object(as shown in the total weight array 1080). To illustrate in an equation,a total weight array, w_(T), may be a summation of the outputs from eachobject's neural network, for example:

${w_{T}\left\lbrack {r,c} \right\rbrack} = {\sum\limits_{i = 0}^{N_{o}}{w_{i}\left\lbrack {r,c} \right\rbrack}}$

where N₀ is the number of objects.

After the total weight array 1080 is generated, the total weight array1080 may be compared to a target as described above with reference toFIG. 1. In addition, using the weighted luma average equation describedabove in regards to FIG. 1, a specific weight may be disabled for thesecond weighting model by setting an array associated with the weight tozero.

FIG. 11 is a flowchart illustrating an embodiment of a process 1100 forautomatic exposure control (AEC) using a second weighting model. In someaspects, the process 1100 may be performed by a computing device.

Process 1100 is illustrated as a logical flow diagram, the operation ofwhich represent a sequence of operations that may be implemented inhardware, computer instructions, or a combination thereof. In thecontext of computer instructions, the operations representcomputer-executable instructions stored on one or more computer-readablestorage media that, when executed by one or more processors, perform therecited operations. Generally, computer-executable instructions includeroutines, programs, objects, components, data structures, and the likethat perform particular functions or implement particular data types.The order in which the operations are described is not intended to beconstrued as a limitation, and any number of the described operationsmay be combined in any order and/or in parallel to implement theprocesses.

Additionally, the process 1100 may be performed under the control of oneor more computer systems configured with executable instructions and maybe implemented as code (e.g., executable instructions, one or morecomputer programs, or one or more applications) executing collectivelyon one or more processors, by hardware, or combinations thereof. Asnoted above, the code may be stored on a machine-readable storagemedium, for example, in the form of a computer program comprising aplurality of instructions executable by one or more processors. Themachine-readable storage medium may be non-transitory.

The process 1100 may include receiving an image captured by a contentcapture device (1105). In some examples, the image may include aplurality of pixels. In some examples, the image may be received from acamera or other content capture device. In other examples, the image maybe received from a feed (or stream) of images. In such examples, thefeed may be current or past images.

The process 1100 may further include identifying a target luma value forthe image (1110). In some examples, the target luma value may indicatean optimal amount of luma for an image. In such examples, each pixel ofthe image may be associated with a luma value. In some examples, thetarget luma value may correspond to an average of the luma values of theimage. In other examples, the target luma value may correspond to aweighted average of the luma values of the image. In some examples, thetarget luma value may be predefined. In other examples, the target lumavalue may be determined based upon one or more sensors. For oneillustrative example, a sensor may detect an amount of light in anenvironment.

The process 1100 may further include identifying an object in the image(1115). In some examples, the object may be identified using an objectidentification system. In such examples, the object identificationsystem may identify a type of the object (e.g., a person, an animal, abuilding, or the like). In other examples, the object identificationsystem may identify one or more attributes or characteristics of theobject, more than just a type of the object (e.g., that the object isShania Twain). In some examples, the object identification system mayidentify whether the object is being processed by another system (suchas a mixed reality system that is determining where to place a virtualobject).

The process 1100 may further include identifying one or more attributesof the object (1120). In some examples, an attribute may be associatedwith a weight array, such as a weight array for object priority, objectsize, object distance, metering, focus reticle, eye gaze, or the like(as described with reference to FIG. 9). In such examples, the attributemay be a single value that represents an average of the weights of eachof the pixel groups of the object.

The process 1100 may further include calculating a weight for the objectusing a neural network (1125). In some examples, the neural network mayuse the one or more attributes as input (as described with reference toFIG. 9). In such examples, the one or more attributes may be associatedwith a single object. An additional neural network may be used for oneor more other objects (or each of the one or more other objects). Inother examples, the neural network may be associated with a plurality ofobjects.

The process 1100 may further include dividing the plurality of pixels ofthe image into a plurality of pixel groups (1130). In some examples, apixel group may represent a contiguous (or adjacent/adjoining) group ofpixels. In such examples, a shape of the pixel group may vary. Forexample, each pixel group may be a rectangle or square such that theimage is divided into rows and columns. In other examples, each pixelgroup may include a certain number of pixels from a center of the image.In some examples, different pixel groups may be different shapes (e.g.,if a pixel group is created such that it includes an object). In someexamples, pixel groups may be of different size.

The process 1100 may further include defining a first set of pixelgroups not associated with the object (1135). In some examples, thefirst set of pixel groups may include one or more other objects and/orno objects. In some examples, not being associated with the object mayindicate that a pixel of one or more pixels that have been identified asthe object is not included in the first set of pixel groups.

The process 1100 may further include defining a second set of pixelgroups associated with the object (1140). In some examples, beingassociated with the object may indicate that a pixel of one or morepixels that have been identified as the object is included in the secondset of pixel groups.

The process 1100 may further include calculating a pixel group lumavalue for each pixel group of the second set of pixel groups (1145A). Insome examples, the pixel group luma value for a pixel group may includean average of luma values for each pixel of the pixel group. In otherexamples, the pixel group luma value may be a sum of the luma values foreach pixel of a pixel group. In other examples, the pixel group lumavalue may be a difference between luma values of each pixel and anaverage luma value for the image. For example, an average luma value maybe computed for the frame, taking into account every pixel. The averageluma value may then be used to compute a difference for each pixel suchthat a difference from the average luma value is generated for a pixelgroup. It should be recognized that other summary values of luma valuesof a pixel group may be used.

In addition, the process 1100 may further include multiplying the pixelgroup luma value by the weight to provide a weighted pixel group lumavalue for each pixel group of the second set of pixel groups (1145B). Insome examples, the weight may be the same for each pixel groupassociated with the object.

The process 1100 may further include calculating a total luma value forthe image (1150). In some examples, the total luma value may include asummation of the weighted pixel group luma values. In some examples,calculating the image luma value may include summing each of theweighted pixel group luma values, where a weighted pixel group lumavalue may be computed by multiplying a weight associated with a pixelgroup with a luma value associated with the pixel group.

The process 1100 may further include computing a difference between thetotal luma value and the target luma value (1155), and updating asetting of the content capture device based upon the computed difference(1160). In some examples, the setting may be an exposure setting or again setting. In such examples, the exposure setting may be a shutterspeed, an ISO speed, or any combination thereof. In some examples, theshutter speed may be a global shutter or a rolling shutter. The globalshutter may indicate a duration of time to expose all pixels of a fieldof view. The rolling shutter may indicate a duration of time to expose arow (either horizontally or vertically) of a field of view. In therolling shutter, lines of an image may be scanned in a rolling mannerrather than a snapshot of the field of view. The gain setting may bedigital gain, analog gain, or any combination thereof. For example, again setting may be 8× by having an analog gain of 2× and a digital gainof 4×. In some examples, the exposure setting may be adjusted before thegain setting when increasing exposure. In such examples, the gainsetting may be adjusted before the exposure setting when decreasingexposure. In some examples, the amount of adjustment may be proportionalto the difference. For example, the adjustment may be larger if thedifference between the image luma value and the target luma value islarger.

FIG. 18 is a flowchart illustrating an embodiment of a process 1800 forautomatic exposure control using multiple models. When using multiplemodels, two or more of the multiple models may be the same or differenttypes of models. For example, multiple neural networks (as describedabove in FIGS. 9-11) may be used. When multiple models are used(according to this figure), each of the multiple models may provideweights for a particular portion of an image such that weightsassociated with each of multiple models do not overlap.

In some aspects, the process 1800 may be performed by a computingdevice. The process 1800 is illustrated as a logical flow diagram, theoperation of which represent a sequence of operations that may beimplemented in hardware, computer instructions, or a combinationthereof. In the context of computer instructions, the operationsrepresent computer-executable instructions stored on one or morecomputer-readable storage media that, when executed by one or moreprocessors, perform the recited operations. Generally,computer-executable instructions include routines, programs, objects,components, data structures, and the like that perform particularfunctions or implement particular data types. The order in which theoperations are described is not intended to be construed as alimitation, and any number of the described operations may be combinedin any order and/or in parallel to implement the processes.

Additionally, the process 1800 may be performed under the control of oneor more computer systems configured with executable instructions and maybe implemented as code (e.g., executable instructions, one or morecomputer programs, or one or more applications) executing collectivelyon one or more processors, by hardware, or combinations thereof. Asnoted above, the code may be stored on a machine-readable storagemedium, for example, in the form of a computer program comprising aplurality of instructions executable by one or more processors. Themachine-readable storage medium may be non-transitory.

The process 1800 may include receiving an image captured by a contentcapture device (1802). In some examples, the image may include aplurality of pixels. In some examples, the image may be received from acamera or other content capture device. In other examples, the image maybe received from a feed (or stream) of images. In such examples, thefeed may be images that are either recently taken or stored.

The process 1800 may further include identifying a target luma value forthe image (1804). In some examples, the target luma value may indicatean optimal amount of luma for an image. In such examples, each pixel ofthe image may be associated with a luma value. In some examples, thetarget luma value may correspond to an average of the luma values of theimage. In other examples, the target luma value may correspond to aweighted average of the luma values of the image. In some examples, thetarget luma value may be predefined. In other examples, the target lumavalue may be determined based upon one or more sensors. For oneillustrative example, a sensor may detect an amount of light in anenvironment.

The process 1800 may further include dividing the plurality of pixels ofthe image into a plurality of pixel groups (1806). In some examples, apixel group may represent a contiguous (or adjacent/adjoining) group ofpixels. In such examples, a shape of the pixel group may vary. Forexample, each pixel group may be a rectangle or square such that theimage is divided into rows and columns. In other examples, each pixelgroup may include a certain number of pixels from a center of the image.In some examples, different pixel groups may be different shapes (e.g.,if a pixel group is created such that it includes an object). In someexamples, pixel groups may be of different size.

The process 1800 may further include identifying multiple patches in theimage (1808).

The multiple patches may include a first patch and a second patch. Eachpatch may include one or more pixel groups. In some examples, somepatches may include more or less pixel groups than other patches. Insome examples, a patch may be identified based upon content of theimage. For example, one or more objects may be identified in the image,where each patch includes pixels of a different object. In otherexamples, each patch may be a set of one or more pixels, each set beingused no matter what image is received. It should be recognized thatother methods for identifying patches may be used.

The process 1800 may further include calculating one or more weights forthe first patch (1810). The one or more weights for the first patch maybe calculated using a first model. The first model may be any modeldescribed herein, such as an object priority weight array, object sizeweight array, object distance weight array, metering weight array, focusreticle weight array, eye gaze weight array, a neural network that isbased upon content included in the first patch, or the like.

The process 1800 may further include calculating one or more weights forthe second patch (1812). The one or more weights for the second patchmay be calculated using a second model, which may be the same ordifferent model as the first model. It should be recognized that morethan two models may be used. It should also be recognized that certainpatches may use the same model while other patches in the same image mayuse different models. In such cases, patches that are determined to haveone or more particular characteristics in common may end up using thesame model.

The process 1800 may further include calculating a pixel group lumavalue for each pixel group of the second set of pixel groups (1814). Insome examples, the pixel group luma value for a pixel group may includean average of luma values for each pixel of the pixel group. In otherexamples, the pixel group luma value may be a sum of the luma values foreach pixel of a pixel group. In other examples, the pixel group lumavalue may be a difference between luma values of each pixel and anaverage luma value for the image. For example, an average luma value maybe computed for the frame, taking into account every pixel. The averageluma value may then be used to compute a difference for each pixel suchthat a difference from the average luma value is generated for a pixelgroup. It should be recognized that other summary values of luma valuesof a pixel group may be used.

In addition, the process 1800 may further include, for each pixel groupof the second set of pixel groups, multiplying the pixel group lumavalue for the pixel group by the weight for the pixel group to provide aweighted pixel group luma value (1816). In some examples, the weight maybe the same for each pixel group associated with the object.

The process 1800 may further include calculating a total luma value forthe image (1818). In some examples, the total luma value may include asummation of the weighted pixel group luma values. In some examples,calculating the image luma value may include summing each of theweighted pixel group luma values, where a weighted pixel group lumavalue may be computed by multiplying a weight associated with a pixelgroup with a luma value associated with the pixel group.

The process 1800 may further include computing a difference between thetotal luma value and the target luma value (1820), and updating asetting of the content capture device based upon the computed difference(1822). In some examples, the setting may be an exposure setting or again setting. In such examples, the exposure setting may be a shutterspeed, an ISO speed, or any combination thereof. In some examples, theshutter speed may be a global shutter or a rolling shutter. The globalshutter may indicate a duration of time to expose all pixels of a fieldof view. The rolling shutter may indicate a duration of time to expose arow (either horizontally or vertically) of a field of view. In therolling shutter, lines of an image may be scanned in a rolling mannerrather than a snapshot of the field of view. The gain setting may bedigital gain, analog gain, or any combination thereof. For example, again setting may be 8× by having an analog gain of 2× and a digital gainof 4×. In some examples, the exposure setting may be adjusted before thegain setting when increasing exposure. In such examples, the gainsetting may be adjusted before the exposure setting when decreasingexposure. In some examples, the amount of adjustment may be proportionalto the difference. For example, the adjustment may be larger if thedifference between the image luma value and the target luma value islarger.

FIG. 12A illustrates an image stitching process 1200. In some examples,the image stitching process 1200 may include multiple instances ofautomatic exposure control (AEC), each instance adjusting the same ordifferent one or more settings. And because AEC instances are used, anyweighting model may be used, including the first and second weightingmodels discussed above. In some examples, the image stitching process1200 may create an output image 1204 that combines two or more images.In such examples, the image stitching process 1200 may combine imagesthat were captured based upon adjustments from each of the multipleinstances of AEC. For example, after one or more settings are set due toan instance of an AEC, an image may be captured that is combined withother images that were also captured after one or more settings were setdue to other instances of an AEC.

In some examples, a content capture device 1210 may be configured in ahigh dynamic range (HDR) mode. The HDR mode may facilitate capturingmultiple images of a similar scene and stitching the multiple imagestogether. In such examples, the HDR mode may include an ability to haveone or more settings changed for particular images. In a traditional HDRmode, software and/or hardware associated with a content capture devicemay programmed to automatically adjust an exposure value (e.g., a numberthat represents a combination of a content capture device's shutterspeed and f-number, sometimes abbreviated as EV) for a sequence of 3 or5 frames by +/−1 EV or +/−1 EV and +/−3 EV. In some examples, EV may bean exposure step of a predetermined sized. For example, a 3-frame HDRmode may adjust the EV to −1 for a first frame, 0 for a second frame,and +1 for a third frame. In such examples, the traditional HDR mode mayblindly adjust exposure by a fixed amount in an attempt to increasecontrast in an image. The object-based exposure methods described hereinare able to provide proper settings for a number of objects in theimage. For example, a 3-frame HDR mode may adjust the EV for a firstobject for a first frame, for a second object in a second frame, and forthe remaining objects in a third frame.

In some examples, an AEC instance may be associated with one or moreobjects of an image. In such examples, the AEC instance may determine toupdate one or more settings of a content capture device 1210 for the oneor more objects. In some examples, each instance of AEC may beassociated with a different image and different one or more objects suchthat an update from an instance would be customized for one or moreobjects of the instance. In such examples, images captured usingdifferent updates (each update for a different one or more objects) maybe combined to create an image that is individually and separatelycustomized for different objects.

In some examples, the image stitching process 1200 may include receivingthree images (e.g., image n (1212), image n+1 (1214), and image n+2(1216)) from the content capture device 1210. The three images may bereceived at once or sequentially. In addition, it should be recognizedthat more or less than three images may be received.

In some examples, the three images may be received by an imagecontroller 1220. The image controller 1220 may determine which AECinstance to send an image. In some examples, the image controller 1220may send images to AEC instances in a loop as images are received. Forexample, a loop may cause a first image (e.g., image n (1212)) to besent to a first AEC instance 1230, a second image (e.g., image n+1(1214)) to be sent to a second AEC instance 1240, and a third image(e.g., image n+2 (1216)) to be sent to a third AEC instance 1250. Usingan order of the loop, a fourth image may be sent to the first AECinstance 1230 (similar to the first image), a fifth image may be sent tothe second AEC instance 1240 (similar to the second image), and a sixthimage may be sent to the third AEC instance 1250 (similar to the thirdimage).

It should be recognized that objects in an image may be divided intolarger or smaller groups such that more or less images are included inone complete loop. For example, a loop of size two would cause a thirdimage to be sent to a first AEC instance rather than a third AECinstance. In some examples, the images of the image stitching process1200 may be associated with approximately the same scene such that theimages may be easily combined together.

In some examples, the first AEC instance 1230 may determine one or morefirst settings 1232 (e.g., an exposure setting, a gain setting, or anycombination thereof) for one or more first objects from the first image.The one or more first settings 1232 may be sent to the content capturedevice 1210 to be used for a future image. In some examples, the futureimage may be the next image. In other examples, the future image may bebased upon a number of the AEC instances. For example, if there arethree AEC instances, the future image may be three images in the future.

In some examples, the second AEC instance 1240 may determine one or moresecond settings 1242 (e.g., an exposure setting, a gain setting, or anycombination thereof) for one or more second objects from the secondimage. In such examples, the second image may be captured before orafter the first image. The one or more second settings 1242 may be sentto the content capture device 1210 to be used for a future image,similar to as described above for the first AEC instance.

In some examples, the third AEC instance 1250 may determine one or morethird settings 1252 (e.g., an exposure setting, a gain setting, or anycombination thereof) for one or more third objects from the third image.In such examples, the third image may be captured before or after thesecond image. The one or more third settings 1252 may be sent to thecontent capture device 1210 to be used for a future image, similar to asdescribed above for the first AEC instance.

In some examples, after the adjustments to the content capture device1210 from the AEC instances, the content capture device 1210 maycontinue to capture images to be sent to the image controller 1220. Insuch examples, as the adjustments are made, the new images may beoptimized for particular objects, thereby incrementally improvingexposure for images.

In some examples, the image controller 1220 may also send the images toan image stitcher 1202. The image stitcher 1202 may combine the imagesbased upon the identified objects. For example, the image stitcher 1202may receive a first image that had been optimized for one or more firstobjects, a second image that had been optimized for one or more secondobjects, and a third image that had been optimized for other portionsnot included in the first and second objects and/or the third image as awhole. In such an example, the image stitcher 1202 may use portions ofthe first image that are associated with the one or more first objects,portions of the second image that are associated with the one or moresecond objects, and portions of the third to create the output image1204.

In some examples, the image stitcher 1202 may wait before stitchingimages together for a number of images that corresponds to a number ofAEC instances. For example, if there are three AEC instances, the imagestitcher 1202 may wait until it receives three images before combining(or stitching) the images.

In some examples, the image stitcher 1202 may be aware of the identifiedobjects and their relative priorities. In such examples, the imagestitcher 1202 may generate the output image 1204 based upon theidentified objects and their relative priorities. For example, the imagestitcher 1202 may generate lower priority portions first, and thengenerate higher priority portions on top of the lower priority portions.In other words, images adjusted for higher priority objects are overlaidon top of images adjusted for lower priority objects. In some examples,the image stitcher 1202 may produce an image. In such examples, theimage stitcher 1202 may be programmed to operate on the 3 images withthe assigned priorities. As AEC settles for each of the 3 images, theimage stitcher 1202 may produce a progressively better stitched image.The stitched image may be used in a variety of ways, including todisplay to a user (e.g., in a viewfinder of the content capture deviceor on a remote display), as input to an object identification systemsuch that objects are identified in the stitched image rather than anun-optimized image, or to store in memory (local or remote from thecontent capture device) for later use.

According to an embodiment of the present invention, a method isprovided. The method includes receiving a first image captured by acontent capture device, identifying a first object in the first image,and determining a first update to a first setting of the content capturedevice, wherein the first update is determined for the first object. Themethod also includes receiving a second image captured by the contentcapture device, wherein the second image is captured after the firstimage, identifying a second object in the second image, and determininga second update to a second setting of the content capture device,wherein the second update is determined for the second object. Themethod further includes performing the first update to the first settingof the content capture device, receiving a third image captured by thecontent capture device, wherein the third image is captured after thefirst update is performed, and performing the second update to thesecond setting of the content capture device. The method additionallyincludes receiving a fourth image captured by the content capturedevice, wherein the fourth image is captured after the second update isperformed and combining the third image and the fourth image into asingle image.

As an example, the first setting and the second setting can beassociated with exposure or gain. The first setting can be the secondsetting. Moreover, the third image and the fourth image can be combinedusing an image stitcher. The first update can be determined using aneural network. In an embodiment, the first update is different than thesecond update. For instance, the first image and the second image canhave the same field of view.

FIG. 12B illustrates another example of an image stitching process thatmay use multiple instances of automatic exposure control. The processillustrated in FIG. 12B shares some similarities with that illustratedin FIG. 12A and description provided in relation to FIG. 12A isapplicable to FIG. 12B as appropriate. Referring to FIG. 12B, a contentcapture device 1210 (referenced as camera in FIG. 12B for purposes ofclarity) is used to capture a first image (Image N 1261).

The image N 1261 is provided to image controller 1220. The imagecontroller 1220 identifies up to a predetermined number of priorityobjects in the image N, wherein the predetermined number is greater thanor equal to two. As an example, 2-5 priority objects can be identifiedin an embodiment. The example illustrated in FIG. 12B uses threepriority objects 1267, 1268, and 1269 in composite image 1266. Image1266 may have 3 priority objects because 1) that is the maximum allowed(i.e. the predetermined number of priority objects is 3—even if morethan 3 potential priority objects were identified, only the top 3options were chosen), or 2) only 3 objects were identified in the imagethat could be considered priority objects even though the maximum wasnot reached (for example, the predetermined number of priority objectswas 5), or 3) the system will always return the predetermined number ofpriority objects regardless of what is contained in the scene (i.e. thesystem will force a selection of 3). Although three priority objects areillustrated in FIG. 12B, it will be evident to one of skill in the artthat a smaller number (e.g., 2) or a greater number, (e.g., 4 or more)can be utilized in embodiments of the present invention. In someexamples, a set of priority objects are selected for use from a largerset of priority objects identified in the first image. To reduce systemcomplexity and computational burden, the number of priority objects maybe limited to a given number, for example, a maximum of five priorityobjects.

The image controller 1220 sends the image to multiple AEC instances1230, 1240, and 1250, where the number of AEC instances is equal to thenumber of priority objects identified. In some examples, the imagecontroller 1220 may send the image N to N AEC instances concurrently ina serial manner.

The AEC instances are used to determine, for each of the predeterminednumber of priority objects, updates for one or more settings of thecontent capture device. As illustrated in FIG. 12B, the image N 1261 canbe analyzed using the three different AEC instances 1230, 1240, and 1250to determine the updates for the one or more settings, with each of theupdates customized to one of the priority objects. Accordingly, the AECinstances are used to determine settings for the content capture device,with each AEC instance performing functions that are customized to oneof the particular priority objects. The settings may be an exposuresetting, a gain setting, or any combination thereof.

An iterative process is then performed with the number of iterationsbeing equal to the predetermined number. As illustrated in FIG. 12B, aset of images N+1, N+2, and N+3 are captured as the content capturedevice is updated using each of the predetermined number of updates(i.e., three updates). The first image is customized to the firstpriority object, providing optimized exposure settings for the firstpriority object in an embodiment. In an iterative manner, the contentcapture device is updated using the next update for one or more settingsof the content capture device and a next image is acquired. This processcontinues for each of the updates until the predetermined number ofimages have been captured. Thus, in FIG. 12B, each of the set of imagesN+1, N+2, N+3 are captured using one or more settings customized (e.g.,optimized) to the respective priority objects 1267, 1268, and 1269. Ofcourse, more than three settings and images can be captured.

As will be evident to one of skill in the art, each of the set of images1265 will include a portion that is associated with each of thepredetermined number of priority objects. As an example, image N+1 couldbe associated with priority object 1267, image N+2 could be associatedwith priority object 1268, and image N+3 could be associated withpriority object 1269. The set of images, each captured using settingscustomized (e.g., optimized) to a particular priority object, areprovided to image stitcher 1260. The image stitcher 1260 is used tostitch the predetermined number of images (i.e., the set of images 1265)together into a composite image 1266. As illustrated in FIG. 12B, theset of images 1265 can be used to extract the portion of each image thatis associated with one of the priority objects and these portions canthen be stitched together to form composite image 1266 that combines theportions into a single image. Since one portion of each of the set ofimages is optimized for one of the priority objects, the single imagewill include optimized images of each priority object. The compositeimage can then be displayed to the user.

Moreover, in some implementations, the functionality of the imagecontroller 1220 and the image stitcher 1260 can be combined in a singlemaster image processing unit.

FIG. 12C is a flowchart illustrating an embodiment of a process forusing multiple instances of automatic exposure control. The method 1270includes receiving an image from a content capture device (1272) andidentifying a predetermined number (n) of priority objects within theimage (1274). In an embodiment, the identified priority objects can bethe top n priority objects within the image. Although three priorityobjects are illustrated in FIG. 12B, embodiments of the presentinvention are not limited to three and n can be larger than or less thanthree. The method also includes determining settings of the contentcapture device for priority object 1 (1276), determining settings of thecontent capture device for priority object 2 (1278), and determiningsettings of the content capture device for priority object n (1280). Asan example, the settings, also referred to as one or more settings, canbe optimized settings in the sense that they provide the exposure valuesthat result in desirable (e.g., best) exposure of the priority object ofinterest. As illustrated in FIG. 12B, the AEC instances 1230, 1240, and1250 can be utilized to provide these settings, which can include anexposure setting, a gain setting, or any combination thereof.

The one or more first settings may be sent to the content capture devicein order to adjust the settings. Accordingly, the settings for thecontent capture device are adjusted for priority object 1 (1282),priority object 2 (1284), and priority object 3 (1286). Using the set ofupdated settings, a set of images are captured using the content capturedevice. An image that is adapted for priority object 1 is captured(1288), an image that is adapted for priority object 2 is captured(1290), and an image that is adapted for priority object 3 is captured(1292). Because these new images are optimized for particular priorityobjects, the exposure for each of these images is improved with relationto each priority object.

The method also includes stitching portions of the set of imagestogether to form a composite image that is adapted for priority objects1 through n 1294. Thus, the image stitcher 1202 illustrated in FIG. 12Bmay use a portion of the first image that is associated with the firstpriority object, a portion of the second image that is associated withthe second priority object, and a portion of the third that isassociated with the third priority object to create the single image1266. The description related to image stitching in FIG. 12A isapplicable to the method described in relation to FIG. 12C asappropriate.

Thus, using a single image received at block 1272 that includes multiplepriority objects, one or more AEC instances can be used to determineupdated (e.g., optimized) settings that are specific to each of thepriority objects identified in the single image. Then, using the updatedsettings, a set of images can be captured, with each image in the setbeing customized (e.g., optimized) for each of the priority objects.Stitching together portions of each image in the set, where the portionsare associated with one of the priority object, enables the productionof a stitched image that can be displayed to the user 1296 in which theexposure settings vary throughout the composite image and are customizedto the priority objects.

FIG. 13 illustrates an example of an image stream that may be used withan image stitching process (e.g., the image stitching process 1200). Theimage stream illustrates a plurality of images that may be received byan image controller (e.g., the image controller 1220). In the example ofFIG. 13, an object in an image with dotted lines indicates that an AECinstance does not concentrate on the object. On the other hand, anobject in an image with solid lines indicates that an AEC instance isperformed based upon the object.

The images in FIG. 13 are separated into which images are sent to whichAEC instance. For example, image 1 (1332) is sent to a first AECinstance 1330. The image 1 (1332) is illustrated as having a firstobject 1362 with solid lines, which indicates that the first AECinstance 1330 is performed based upon the first object 1362. The image 1(1332) also has a second object 1364 and a third object 1366. The secondobject 1364 and the third object 1366 are illustrated as having dottedlines, indicating that the first AEC instance 1330 treats both objectsas not being identified.

The first AEC instance 1330 may determine an adjustment for one or moresettings (e.g., an exposure setting, a gain setting, or any combinationthereof) of a content capture device (e.g., the content capture device1210). The adjustment may be sent to the content capture device 1210such that the content capture device 1210 adjusts the one or moresettings of the content capture device 1210. In some examples, the oneor more settings may be sent to the content capture device 1210 rightbefore the content capture device 1210 is to capture an image. In suchexamples, upon receipt of the one or more settings, the content capturedevice 1210 may adjust its settings. In other examples, the contentcapture device 1210 may be configured to receive the one or moresettings and wait until it is time to capture an image with the one ormore settings (such as based upon a loop as described above).

Image 2 (1368) may be sent to a second AEC instance 1340. The second AECinstance 1340 may be performed based upon an object corresponding to thesecond object 1364. Image 3 (1370) may be sent to a third AEC instance1350. The third AEC instance 1350 may be performed based upon an objectcorresponding to the third object 1366. The second and the third AECinstances (1340, 1350) may also determine and send one or more settingsto the content capture device 1210 (similar to as described above forthe first AEC instance 1330).

Image 1 (1360), image 2 (1368), and image 3 (1370) may represent a loop.After one or more settings are set due to an AEC instance, an additionalimage may be captured with the one or more settings. After theadditional image is captured, the image controller may determine whichAEC instance that the one or more settings were set from. The imagecontroller may then send the additional image to the determined AECinstance. For example, the first AEC instance 1330 may set one or morefirst settings based upon image 1 (1360). When the image controllerreceives image 4 (1372), which may be captured using the one or morefirst settings, the image controller may send the image 4 (1372) to thefirst AEC instance 1330. Similarly, image 5 (1374) may be sent to thesecond AEC instance 1340 and image 6 (1376) may be sent to the third AECinstance 1350.

The images 4, 5 and 6 (1372, 1374, 1376) may represent a second loop.This process may continue until image n, n+1, and n+2 (1378, 1380, 1382)are received. A similar process may occur, including determining whichAEC instance is associated with the new images, and sending the newimages to an AEC instance that generated one or more settings used tocapture the new images. For example, the image n (1378) may be sent tothe first AEC instance 1330, the image n+1 (1380) may be sent to thesecond AEC instance 1340, and the image n+2 (1382) may be sent to thethird AEC instance 1350.

FIG. 14 is a flowchart illustrating an embodiment of a process 1400 forautomatic exposure control using a third weighting model. In someaspects, the process 1400 may be performed by a computing device.

Process 1400 is illustrated as a logical flow diagram, the operation ofwhich represent a sequence of operations that may be implemented inhardware, computer instructions, or a combination thereof. In thecontext of computer instructions, the operations representcomputer-executable instructions stored on one or more computer-readablestorage media that, when executed by one or more processors, perform therecited operations. Generally, computer-executable instructions includeroutines, programs, objects, components, data structures, and the likethat perform particular functions or implement particular data types.The order in which the operations are described is not intended to beconstrued as a limitation, and any number of the described operationsmay be combined in any order and/or in parallel to implement theprocesses.

Additionally, the process 1400 may be performed under the control of oneor more computer systems configured with executable instructions and maybe implemented as code (e.g., executable instructions, one or morecomputer programs, or one or more applications) executing collectivelyon one or more processors, by hardware, or combinations thereof. Asnoted above, the code may be stored on a machine-readable storagemedium, for example, in the form of a computer program comprising aplurality of instructions executable by one or more processors. Themachine-readable storage medium may be non-transitory.

The process 1400 may include receiving a first image captured by acontent capture device (1405). In some examples, the content capturedevice may be in a high dynamic range (HDR) mode. The HDR mode may causea plurality of pictures to be taken close in time with varying settings(e.g., an exposure setting, a gain setting, or any combination thereof).

The process 1400 may further include identifying a first object in thefirst image (1410). In some examples, the first object may be identifiedusing an object identification system. In such examples, the objectidentification system may identify a type of the first object (e.g., aperson, an animal, a building, or the like). In other examples, theobject identification system may identify one or more attributes orcharacteristics of the first object, more than just a type of the firstobject (e.g., that the first object is Bob Marley). In some examples,the object identification system may identify whether the first objectis being processed by another system (such as a mixed reality systemthat is determining where to place a virtual object).

The process 1400 may further include determining a first update to afirst setting of the content capture device (1415). In some examples,the first update may be determined for the first object. In suchexamples, the first update may be determined using any of the weightingmodels described herein, including the first weighting model and/or thesecond weighting model. In some examples, the first setting may be anexposure setting, a gain setting, or any combination thereof. In suchexamples, the exposure setting may be a shutter speed, an ISO speed, orany combination thereof. The gain setting may be digital gain, analoggain, or any combination thereof.

The process 1400 may further include receiving a second image capturedby the content capture device (1420). In some examples, the second imagemay be captured before or after the first image. In some examples, thesecond image may be captured according to the HDR mode.

The process 1400 may further include identifying a second object in thesecond image (1425). In some examples, the second object may beidentified using the object identification system described above.

The process 1400 may further include determining a second update to asecond setting of the content capture device (1430). In some examples,the second update may be determined for the second object. In someexamples, the second update may be determined using any of the weightingmodels described herein, including the first weighting model and/or thesecond weighting model. In such examples, the weighting model used forthe second update may be the same or different than the weighting modelused for the first update.

The process 1400 may further include performing the first update to thefirst setting of the content capture device (1435). Performing the firstupdate may include sending one or more instructions to the contentcapture device that causes the content capture device to change thefirst setting. In some examples, the one or more instructions may besent that causes the first setting to be updated at a point in thefuture (e.g., when an image is about to be taken that should be usingthe first setting). In other examples, the one or more instructions maybe sent after an image is captured (or received) from the contentcapture device, the capture indicating that a new image is about to becaptured that may use the first setting.

The process 1400 may further include receiving a third image captured bythe content capture device (1440). In some examples, the third image maybe captured after the first update is performed. In such examples, thefirst update may be applied to the content capture device such that thethird image is captured with an exposure based upon the first update.

The process 1400 may further include performing the second update to thesecond setting of the content capture device (1345). Performing thesecond update may include sending one or more instructions to thecontent capture device that causes the content capture device to changethe second setting. In some examples, the one or more instructions maybe sent that causes the second setting to be updated at a point in thefuture (e.g., when an image is about to be taken that should be usingthe second setting). In other examples, the one or more instructions maybe sent after an image is captured (or received) from the contentcapture device, the captured indicated that a new image is about to becaptured that may use the second setting.

The process 1400 may further include receiving a fourth image capturedby the content capture device (1450). In some examples, the fourth imagemay be captured after the second update is performed. In such examples,the second update may be applied to the content capture device such thatthe fourth image is captured with an exposure based upon the secondupdate.

The process 1400 may further include combining the third image and thefourth image into a single image for viewing (1455). In some examples,combining (e.g., stitching) may include taking a portion of the thirdimage and a portion of the fourth image. For example, the third imagemay correspond to the first image, which was optimized for the firstobject. In such an example, the portion of the third image that is takenmay be associated with the first object such that combining includetaking the portion of the third image that is associated with the firstobject. Similarly, the fourth image may correspond to the second image,which was optimized for the second object. Thus, the portion of thefourth image that is taken may be associated with the second object suchthat combining may include taking the portion of the fourth image thatis associated with the second object. After taking the portions of thethird and fourth image that are associated with the first and secondobject respectively, other portions of the third and fourth image may beaveraged between the third image and the fourth image. The otherportions may not be associated with either the first object or thesecond object such that the other portions were not optimized by thefirst update and/or the second update. However, it should be recognizedthat images may be stitched (or combined) in a variety of ways.

In some examples, the process 1400 may include a first cycle and asecond cycle. The first cycle may include the first image and the secondimage. The second cycle may include the third image and the fourthimage. In some examples, each new cycle may include two more images suchthat the operations continue to be performed on each new cycle. In someexamples, each new cycle may be used to update setting of a contentcapture device (similarly to the first image and the second image) aswell as be combined (similarly to the third image and the fourth image.For example, determining updates (as described for the first image andthe second image) may be performed for the third image and the fourthimage.

FIG. 15 illustrates an example of a block diagram for a computer system.In this example, the computer system 1500 includes a monitor 1510,computer 1520, a keyboard 1530, a user input device 1540, one or morecomputer interfaces 1550, and the like. In the present example, the userinput device 1540 is typically embodied as a computer mouse, atrackball, a track pad, a joystick, wireless remote, drawing tablet,voice command system, eye tracking system, and the like. The user inputdevice 1540 typically allows a user to select objects, icons, text andthe like that appear on the monitor 1510 via a command such as a clickof a button or the like.

Examples of the computer interfaces 1550 typically include an Ethernetcard, a modem (telephone, satellite, cable, ISDN), (asynchronous)digital subscriber line (DSL) unit, FireWire interface, USB interface,and the like. For example, the computer interfaces 1550 may be coupledto a computer network 1555, to a FireWire bus, or the like. In otherembodiments, the computer interfaces 1550 may be physically integratedon the motherboard of the computer 1520, may be a software program, suchas soft DSL, or the like.

In various examples, the computer 1520 typically includes familiarcomputer components such as a processor 1560, and memory storagedevices, such as a random access memory (RAM) 1570, disk drives 1580,and system bus 1590 interconnecting the above components.

The RAM 1570 and disk drive 1580 are examples of tangible mediaconfigured to store data such as embodiments of the present disclosure,including executable computer code, human readable code, or the like.Other types of tangible media include floppy disks, removable harddisks, optical storage media such as CD-ROMS, DVDs and bar codes,semiconductor memories such as flash memories, read-only-memories(ROMS), battery-backed volatile memories, networked storage devices, andthe like.

In various examples, the computer system 1500 may also include softwarethat enables communications over a network such as the HTTP, TCP/IP,RTP/RTSP protocols, and the like. In alternative embodiments of thepresent disclosure, other communications software and transfer protocolsmay also be used, for example IPX, UDP or the like.

The features described herein may be implemented in digital electroniccircuitry, or in computer hardware, firmware, software, or incombinations of them. The apparatus may be implemented in a computerprogram product tangibly embodied in an information carrier, e.g., in amachine-readable storage device, for execution by a programmableprocessor; and method steps may be performed by a programmable processorexecuting a program of instructions to perform functions of thedescribed implementations by operating on input data and generatingoutput. The described features may be implemented advantageously in oneor more computer programs that are executable on a programmable systemincluding at least one programmable processor coupled to receive dataand instructions from, and to transmit data and instructions to, a datastorage system, at least one input device, and at least one outputdevice. A computer program is a set of instructions that may be used,directly or indirectly, in a computer to perform a certain activity orbring about a certain result. A computer program may be written in anyform of programming language, including compiled or interpretedlanguages, and it may be deployed in any form, including as astand-alone program or as a module, component, subroutine, or other unitsuitable for use in a computing environment.

Suitable processors for the execution of a program of instructionsinclude, by way of example, both general and special purposemicroprocessors, and the sole processor or one of multiple processors ofany kind of computer. Generally, a processor will receive instructionsand data from a read-only memory or a random access memory or both. Theessential elements of a computer are a processor for executinginstructions and one or more memories for storing instructions and data.Generally, a computer will also include, or be operatively coupled tocommunicate with, one or more mass storage devices for storing datafiles; such devices include magnetic disks, such as internal hard disksand removable disks; magneto-optical disks; and optical disks. Storagedevices suitable for tangibly embodying computer program instructionsand data include all forms of non-volatile memory, including by way ofexample semiconductor memory devices, such as EPROM, EEPROM, and flashmemory devices; magnetic disks such as internal hard disks and removabledisks; magneto-optical disks; and CD-ROM and DVD-ROM disks. Theprocessor and the memory may be supplemented by, or incorporated in,ASICs (application-specific integrated circuits).

To provide for interaction with a user, the features may be implementedon a computer having a display device such as a CRT (cathode ray tube),LCD (liquid crystal display), LED (light emitting diode) monitor fordisplaying information to the user and a keyboard and a pointing devicesuch as a mouse or a trackball by which the user may provide input tothe computer.

The features may be implemented in a computer system that includes aback-end component, such as a data server, or that includes a middlewarecomponent, such as an application server or an Internet server, or thatincludes a front-end component, such as a client computer having agraphical user interface or an Internet browser, or any combination ofthem. The components of the system may be connected by any form ormedium of digital data communication such as a communication network.Examples of communication networks include a LAN, a WAN, and thecomputers and networks forming the Internet.

The computer system may include clients and servers. A client and serverare generally remote from each other and typically interact through anetwork, such as the described one. The relationship of client andserver arises by virtue of computer programs running on the respectivecomputers and having a client-server relationship to each other.Although a few implementations have been described in detail above,other modifications are possible.

In addition, the logic flows depicted in the figures do not require theparticular order shown, or sequential order, to achieve desirableresults. In addition, other steps may be provided, or steps may beeliminated, from the described flows, and other components may be addedto, or removed from, the described systems. Accordingly, otherimplementations are within the scope of the following claims.

Where components are described as being configured to perform certainoperations, such configuration may be accomplished, for example, bydesigning electronic circuits or other hardware to perform theoperation, by programming programmable electronic circuits (e.g.,microprocessors, or other suitable electronic circuits) to perform theoperation, or any combination thereof.

A number of embodiments of the disclosure have been described.Nevertheless, it will be understood that various modification may bemade without departing from the scope of the disclosure.

What is claimed is:
 1. A method comprising: receiving an image capturedby a content capture device, wherein the image includes a plurality ofpixels; identifying a target luma value for the image; identifying anobject in the image; identifying one or more attributes of the object;calculating a weight for the object using a neural network, wherein theneural network uses the one or more attributes as input; dividing theplurality of pixels of the image into a plurality of pixel groups;defining a first set of pixel groups not associated with the object;defining a second set of pixel groups associated with the object; foreach pixel group of the second set of pixel groups: calculating a pixelgroup luma value; and multiplying the pixel group luma value by theweight to provide a weighted pixel group luma value; calculating a totalluma value for the image, wherein the total luma value includes asummation of the weighted pixel group luma values; computing adifference between the total luma value and the target luma value; andupdating a setting of the content capture device based upon the computeddifference.
 2. The method of claim 1, further comprising: identifying asecond object in the image; identifying one or more attributes of thesecond object; defining a third set of pixel groups that is associatedwith the second object; calculating a weight for the second object usinga second neural network, wherein the second neural network uses the oneor more attributes of the second object as input; and for each pixelgroup of the third set of pixel groups: calculating a second pixel groupluma value; and multiplying the second pixel group luma value by theweight for the second object to provide a weighted second pixel groupluma value, wherein the total luma value further includes a summation ofthe weighted second pixel group luma values.
 3. The method of claim 1,wherein the neural network is a multilayer perceptron.
 4. The method ofclaim 1, wherein an attribute of the one or more attributes of theobject includes an object priority, an object distance from a viewer, anobject distance from a reticle, an object distance from an eye gaze, oran object size.
 5. The method of claim 1, wherein the target luma valueis based upon a field of view.
 6. The method of claim 1, wherein thesetting of the content capture device is associated with exposure orgain.
 7. The method of claim 1, wherein each pixel group of theplurality of pixel groups is the same size.
 8. A method comprising:receiving a first image captured by a content capture device;identifying a predetermined number of priority objects in the firstimage, wherein the predetermined number is greater than or equal to two;determining, for each of the predetermined number of priority objects,one or more updates for one or more settings of the content capturedevice; iteratively: updating the content capture device using each ofthe one or more updates; and receiving the predetermined number ofimages captured by the content capture device using each of the one ormore updates; and stitching the predetermined number of images togetherto form a composite image.
 9. The method of claim 8, further comprisingdisplaying the composite image.
 10. The method of claim 8, wherein thefirst image includes a predetermined number of portions, each of thepredetermined number of portions associated with one of thepredetermined number of priority objects.
 11. The method of claim 10,wherein stitching the predetermined number of images together comprisescombining the predetermined number of portions.
 12. The method of claim8, wherein settings of the one or more settings are associated withexposure or gain.
 13. The method of claim 8, wherein one or more of theone or more updates are determined using a neural network.
 14. Themethod of claim 8, wherein one of the one or more updates is differentthan another of the one or more updates.
 15. The method of claim 8,wherein the first image and predetermined number of images are in a samefield of view.
 16. A method comprising: receiving an image captured by acontent capture device, wherein the image includes a plurality ofpixels; identifying a target luma value for the image; dividing theplurality of pixels of the image into a plurality of pixel groups;calculating a pixel group luma value for each of the plurality of pixelgroups; identifying a location in the image, the location correspondingto a point where a user is looking in an environment corresponding tothe image; setting weights for each of the plurality of pixel groupsbased upon the identified location; calculating an image luma valueusing, for each of the plurality of pixel groups, the weight and thepixel group luma value; computing a difference between the image lumavalue and the target luma value; and updating a setting of the contentcapture device based upon the computed difference.
 17. The method ofclaim 16, wherein the location is identified based upon an image of oneor more eyes of the user.
 18. The method of claim 16, wherein thelocation is identified based upon a direction of a gaze of the user. 19.A method comprising: receiving an image captured by a content capturedevice, wherein the image includes a plurality of pixels; identifying atarget luma value for the image; dividing the plurality of pixels of theimage into a plurality of pixel groups; calculating a pixel group lumavalue for each of the plurality of pixel groups; receiving a depth mapcorresponding to the image; setting weights for each of the plurality ofpixel groups based upon the depth map; calculating an image luma valueusing, for each of the plurality of pixel groups, the weight and thepixel group luma value; computing a difference between the image lumavalue and the target luma value; and updating a setting of the contentcapture device based upon the computed difference.
 20. A methodcomprising: receiving an image captured by a content capture device,wherein the image includes a plurality of pixels; identifying a targetluma value for the image; dividing the plurality of pixels of the imageinto a plurality of pixel groups; identifying multiple patches in theimage, the multiple patches including a first patch and a second patch,wherein the first patch includes one or more pixel groups, and whereinthe second patch includes one or more pixel groups different than theone or more pixel groups of the first patch; calculating, using a firstmodel, one or more weights for the first patch; calculating, using asecond model, one or more weights for the second patch; for each pixelgroup: calculating a pixel group luma value; and multiplying the pixelgroup luma value by the weight to provide a weighted pixel group lumavalue; calculating a total luma value for the image, wherein the totalluma value includes a summation of the weighted pixel group luma values;computing a difference between the total luma value and the target lumavalue; and updating a setting of the content capture device based uponthe computed difference.